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MPR EFFLUX PUMPS 

Background of the Invention 

5 Most bacterial cells express proteins that function as multidrug 

resistance (MDR) efflux pumps, so named because they confer resistance to a 
large variety of chemically unrelated agents by extruding them from the cell. 
MDR efflux pumps have been described in phylogenetically diverse organisms 
including bacteria (Nikaido, J. Bacteriol. 178:5853-5859, 1996), yeast 

10 (Kolaczkowski et al., Microb. Drug Resist 41:143-158, 1998), and mammals. 
Many MDR efflux pumps are members of multigene families, the best- 
described of which being the gene families encoding the ATP Binding 
Cassette (ABC) membrane proteins, the Multiple Facilitator (MF) superfamily, 
and the small multidrug resistance (Smr) family (Michaelis and Berkower, 

15 Cold Spring Harb. Symp. Quant. Biol. 60:291-307, 1995; Paulsen et al., 

Microbiol. Rev. 60:575-608, 1996). MDR efflux pumps have been shown to 
contribute significantly to resistance to antibacterial agents in a number of 
model organisms as well as important Gram-positive pathogens, including 
Staphylococcus (S.) aureus (Yoshida et al., J. Bacteriol. 172:6942-6949, 1990; 

20 Hsieh et al., Proc. Natl. Acad. Sci. USA 95:6602-6606, 1998), Enterococcus 
(E.)faecalis (Lynch et al., Antimicrob. Agents Chemother. 41:869-871, 1997), 
and Streptococcus (5.) pneumoniae (Gill et. al., Antimicrob. Agents 
Chemother. 43: 187-189, 1999). 

The determination of the complete DNA sequence of microbial 

25 genomes has progressed markedly over the last few years. Model organisms 
continue to lead the way with the reported sequence of Saccharomyces (S.) 
cerevisiae (Goffeau et al., Nature 387:1-105, 1997), Escherichia (£.) coli 
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(Blattner et al., Science 277: 1453-1474, 1997), Bacillus (B.) subtilis (Kunst et 
al., Nature 390:249-256, 1997) and Treponema (T.) pallidum (Fraser et al., 
Science 281: 375-388, 1998). Integration of the genetic and biochemical 
information obtained over decades of research with the newly obtained raw 
5 genomic sequence presents an exciting challenge to biologists and 

bioinformaticists. Many open reading frames (ORFs) have been identified and 
catalogued based on homology with known genes, or known folding domains, 
using algorithms such as FASTA and BLAST (reviewed in Storms, 
"Genome-wide strategies for studying gene function by using model systems." 

10 In: Organization of the Prokaryotic Genome, pp. 347-365. Edited by R.L. 
Charlebois. ASM Press, Washington, D.C. 1999). Significant efforts have 
been made to use homology relationships between ORFs to establish 
hypothetical phylogenetic relationships between organisms as well as 
genealogy of multigene families within any given organism. Two multigene 

15 families that have been examined in detail are those encoding the ABC and 
MF families of proteins. For example, the complete genome of B. subtilis led 
to the identification of no less than 59 possible ABC transporters (Quentin et 
al., J. Mol. Biol. 287:467-484, 1999). Likewise, analysis of 5885 predicted 
ORFs in the complete S. cerevisiae genome identified 186 potential MF 

20 proteins (Nelissen et al., FEMS Microbiol. Rev. 21:113-134, 1997). 

Summary of the Invention 

In a first aspect, the invention features a method of determining 
whether a nucleotide sequence encodes an MDR efflux pump. The method 
includes the steps of: (a) searching a database of nucleotide sequences for 
25 sequences having high identity to a sequence encoding a known MDR efflux 
pump to generate a first set of candidate sequences comprising sequences that 
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have high identity to the sequence encoding the known MDR efflux pump; (b) 
from the first set of candidate sequences, selecting the sequences that include a 
sequence encoding potential transmembrane domains to generate a second set 
of candidate sequences; (c) in a bacterial cell, mutating the gene corresponding 
5 to one of the candidate sequences; and (d) determining whether the bacterial 
cell exhibits increased sensitivity to antibacterial agents, wherein the candidate 
sequence encodes an MDR efflux pump if the bacterial cell exhibits increased 
sensitivity to the antibacterial agents. 

In one embodiment, the sequence encoding the known MDR efflux 

10 pump is a nucleotide sequence, and the searching step (a) includes comparing 
the database of nucleotide sequences with the nucleotide sequence encoding 
the known MDR efflux pump. 

In another embodiment, the sequence is a polypeptide sequence, and 
the searching step (a) includes comparing nucleotide sequences of the database 

15 translated into all six reading frames, to the polypeptide sequence of the known 
MDR efflux pump. 

In a related aspect, the invention features a method of determining 
whether a polypeptide functions as an MDR efflux pump. This method 
includes the steps of: (a) searching a database of polypeptide sequences for 

20 sequences having high identity to a polypeptide sequence that functions as an 
MDR efflux pump to generate a first set of candidate sequences; (b) from the 
first set of candidate sequences, selecting sequences that have potential 
transmembrane domains to generate a second set of candidate sequences; (c) in 
a bacterial cell, mutating one of the candidate sequences ; and (d) determining 

25 whether the bacterial cell exhibits increased sensitivity to antibacterial agents, 
wherein the candidate polypeptide functions as an MDR efflux pump if the cell 
exhibits increased sensitivity to an antibacterial agent. 
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In another aspect, the invention features a method for deleting a 
desired region of DNA in a bacterial cell. The method includes the steps of: 
(a) transforming bacterial cells with a vector that includes (i) a first region of at 
least 30 nucleotides substantially identical to a first region of chromosomal 
5 DNA in the bacterial cells; (ii) a second region of at least 30 nucleotides 

substantially identical to a second region of chromosomal DNA in the bacterial 
cells; and (iii) a third region encoding a polypeptide that provides resistance to 
a selection agent (e.g., kanamycin), wherein the first and second regions of 
chromosomal DNA are on opposite sides of the target region of DNA to be 

10 deleted; (b) selecting for bacterial cells in which the vector of step (a) has 

integrated by a first crossover event into the chromosomal DNA by adding the 
selection agent to the culture medium; (c) culturing the cells selected for in 
step (b) in the absence of the selection agent to allow for a second crossover 
event to occur in at least a subset of the cells, wherein the second crossover 

15 event results in the loss of (i) the desired region of DNA and (ii) the region 
encoding the polypeptide that provides resistance to the selection agent in that 
subset of cells. 

The foregoing method can be repeated one or more times, if desired, 
to delete additional regions of DNA in the same bacterial cell from which the 

20 first region was deleted by repealing steps (a) - (c). Accordingly, the invention 
also features a method for making a multigene mutant bacterial cell. The 
method can be performed in any bacterial cell that can undergo homologous 
recombination, including, for example, Staphylococcus aureus, Streptococcus 
pyogenes, Bacillus anthracis, Clostridium tetani 9 Clostridium bolulinum, 

25 Vibrio cholerae, Helicobacter pylori, Salmonella typhimurium, Shigella 

dysenteriae, Bordetella pertussis, Yersinia pestis, Pseudomonas aeruginosa, 
Neisseria gonorrhoeae, Mycobacterium tuberculosis, Corynebacterium 
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diptheriae* Borrelia burdorferi, Treponema pallidum, Enterococcusfaecalis, 
Enterococcus faecium, and Streptococcus pneumoniae. 

. In yet another aspect, the invention features a method for 
determining whether a test substance inhibits the growth or metabolism of cells 
5 of a strain of E. faecalis bacteria having a disruptive mutation in a gene 
encoding a protein selected from the group consisting of Abel (SEQ ID NO: 
2), Abc2 (SEQ ID NO: 4), Abc3 (SEQ ID NO: 6), Abc4 (SEQ ID NO: 8), 
Abc5 (SEQ ID NO: 10), Abc6 (SEQ ID NO: 12), Abc7 (SEQ ID NO: 14), 
Abc8 (SEQ ID NO: 16), Abc9 (SEQ ID NO: 18), AbclO (SEQ ID NO: 20), 

10 Abel 1 (SEQ ID NO: 22), Abcl2 (SEQ ID NO: 24), Abcl3 (SEQ ID NO: 26), 
Abe 14 (SEQ ID NO: 28), AbclS (SEQ ID NO: 30), Abcl6 (SEQ ID NO: 32), 
Abcl7 (SEQ ID NO: 34), Abcl8 (SEQ ED NO: 36), Abcl9 (SEQ ID NO: 38), 
Abc20 (SEQ ID NO: 40), Abc21 (SEQ ID NO: 42), Abc22 (SEQ ID NO: 44), 
Abc23 (SEQ ID NO: 46), Bmr (SEQ ID NO: 48), NorA (SEQ ID NO: 50), 

15 Mfl (SEQ ID NO: 52), Mf2 (SEQ ID NO: 54), Mf3 (SEQ ID NO: 56), Mf4 
(SEQ ID NO: 58), Mf5 (SEQ ID NO: 60), Mf6 (SEQ ID NO: 62), Mf7 (SEQ 
ID NO: 64), Matel (SEQ ID NO: 66), and Smrl (SEQ ID NO: 68). The 
method includes the steps of: (a) contacting the cells with a test substance; and 
(b) determining whether the growth or metabolism of the cells is inhibited. An 

20 inhibition of growth or metabolism, relative to the growth or metabolism of 
cells of the strain of bacteria having the mutation but not contacted with the 
test substance, identifies the test substance as one having the potential use as 
an antibacterial agent. The bacterial strain can have disruptive mutations in 
genes encoding at least two of the proteins listed above. For example, the 

25 bacterial strain can have disruptive mutations in genes encoding three, four, 
five, or six or more of the proteins listed above. 
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In another aspect, the invention features a substantially pure nucleic 
acid molecule consisting essentially of E. faecalis Abel (SEQ ID NO: 1), Abc2 
(SEQ ID NO: 3), Abc3 (SEQ ID NO: 5), Abc4 (SEQ ID NO: 7), Abc5 (SEQ ID 
NO: 9), Abc6 (SEQ ED NO: 1 1), Abc7 (SEQ ID NO: 13), Abc8 (SEQ ID NO: 
5 15), Abc9 (SEQ ID NO: 17), AbclO (SEQ ID NO: 19), Abcll (SEQ ID NO: 
21), Abcl2 (SEQ ID NO: 23), Abcl3 (SEQ ID NO: 25), Abcl4 (SEQ ID NO: 
27), Abcl5 (SEQ ID NO: 29), Abcl6 (SEQ ID NO: 31), Abcl7 (SEQ ED NO: 
33), Abcl8 (SEQ ED NO: 35), Abcl9 (SEQ ED NO: 37), Abc20 (SEQ ED NO: 
39), Abc21 (SEQ ED NO: 41), Abc22 (SEQ ID NO: 43), Abc23 (SEQ ED NO: 

10 45), Bmr (SEQ ED NO: 47), NorA (SEQ ID NO: 49), Mfl (SEQ ID NO: 51), 
Mf2 (SEQ ED NO: 53), Mf3 (SEQ ID NO: 55), Mf4 (SEQ ED NO: 57), Mf5 
(SEQ ID NO: 59), Mf5 (SEQ ID NO: 61), Mf7 (SEQ ED NO: 63), Motel (SEQ 
ED NO: 65), or Smrl (SEQ ED NO: 67). 

En another aspect, the invention features a substantially pure 

15 polypeptide that includes a sequence selected from the group consisting of E. 
faecalis Abel (SEQ ED NO: 2), Abc2 (SEQ ED NO: 4), Abc3 (SEQ ED NO: 6), 
Abc4 (SEQ ED NO: 8), Abc5 (SEQ ED NO: 10), Abc6 (SEQ ED NO: 12), Abc7 
(SEQ ED NO: 14), Abc8 (SEQ ED NO: 16), Abc9 (SEQ ED NO: 18), AbclO 
(SEQ ED NO: 20), Abcll (SEQ ED NO: 22), Abcl2 (SEQ ED NO: 24), Abcl3 

20 (SEQ ED NO: 26), Abcl4 (SEQ ED NO: 28), Abcl5 (SEQ ED NO: 30), Abcl6 
(SEQ ED NO: 32), Abcl7 (SEQ ED NO: 34), Abcl8 (SEQ ED NO: 36), Abcl9 
(SEQ ED NO: 38), Abc20 (SEQ ED NO: 40), Abc21 (SEQ ED NO: 42), Abc22 
(SEQ ED NO: 44), Abc23 (SEQ ED NO: 46), Bmr (SEQ ED NO: 48), NorA 
(SEQ ED NO: 50), Mfl (SEQ ED NO: 52), Mf2 (SEQ ED NO: 54), Mf3 (SEQ 

25 ED NO: 56), Mf4 (SEQ ED NO: 58), Mf5 (SEQ ED NO: 60), Mf6 (SEQ ED 
NO: 62), Mf7 (SEQ ED NO: 64), Matel (SEQ ED NO: 66), and Smrl (SEQ ED 
NO: 68). 



-6- 



WO 01/79257 



PCT/US01/12230 



In still another aspect, the invention features a strain of E. faecalis 
bacteria having a disruptive mutation in a gene encoding a protein selected 
from the group consisting of Abel (SEQ ID NO: 2), Abc2 (SEQ ID NO: 4), 
Abc3 (SEQ ID NO: 6), Abc4 (SEQ ID NO: 8), Abc5 (SEQ ID NO: 10), Abc6 
5 (SEQ ID NO: 12), Abc7 (SEQ ID NO: 14), Abc8 (SEQ ID NO: 16), Abc9 
(SEQ ID NO: 18), AbclO (SEQ ID NO: 20), Abcll (SEQ ID NO: 22), Abe 12 
(SEQ ID NO: 24), Abcl3 (SEQ ID NO: 26), Abcl4 (SEQ ID NO: 28), Abcl5 
(SEQ ID NO: 30), Abcl6 (SEQ ID NO: 32), Abcl7 (SEQ ID NO: 34), Abcl8 
(SEQ ID NO: 36), Abcl9 (SEQ ID NO: 38), Abc20 (SEQ ID NO: 40), Abc21 

10 (SEQ ID NO: 42), Abc22 (SEQ ID NO: 44), Abc23 (SEQ ID NO: 46), Bmr 
(SEQ ID NO: 48), NorA (SEQ ID NO: 50), Mfl (SEQ ID NO: 52), Mf2 (SEQ 
ID NO: 54), Mf3 (SEQ ID NO: 56), Mf4 (SEQ ID NO: 58), Mf5 (SEQ ID 
NO: 60), Mf6 (SEQ ID NO: 62), Mf7 (SEQ ID NO: 64), Matel (SEQ ID NO: 
66), and Smrl (SEQ ID NO: 68). The bacterial strain can have disruptive 

15 mutations in genes encoding at least two of the proteins listed above. More 
preferably, the bacterial strain can have disruptive mutations in genes encoding 
three, four, five, or six or more of the proteins listed above. 

The invention also features a method for determining whether a test 
substance includes a compound that blocks efflux, of an antibacterial agent 

20 from a cell. The method includes the steps of: (a) providing a polypeptide 
selected from the group consisting of E. faecalis Abel (SEQ ID NO: 2), Abc2 
(SEQ ID NO: 4), Abc3 (SEQ ID NO: 6), Abc4 (SEQ ID NO: 8), Abc5 (SEQ 
ED NO: 10), Abc6 (SEQ ID NO: 12), Abc7 (SEQ ED NO: 14), Abc8 (SEQ ED 
NO: 16), Abc9 (SEQ ED NO: 18), AbclO (SEQ ED NO: 20), Abcll (SEQ ED 

25 NO: 22), Abcl2 (SEQ ED NO: 24), Abcl3 (SEQ ED NO: 26), Abcl4 (SEQ ED 
NO: 28), Abcl5 (SEQ ED NO: 30), Abcl6 (SEQ ED NO: 32), Abcl7 (SEQ ED 
NO: 34), Abcl8 (SEQ ED NO: 36), Abcl9 (SEQ ED NO: 38), Abc20 (SEQ ED 
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NO: 40), Abc21 (SEQ ED NO: 42), Abc22 (SEQ ID NO: 44), Abc23 (SEQ ED 
NO: 46), Bmr (SEQ ID NO: 48), NorA (SEQ ED NO: 50), Mfl (SEQ ID NO: 
52), MO (SEQ ID NO: 54), MB (SEQ ED NO: 56), Mf4 (SEQ ED NO: 58), 
Mf5 (SEQ ED NO: 60), Mf6 (SEQ ED NO: 62), Mf7 (SEQ ID NO: 64), Matel 
5 (SEQ ED NO: 66), and Smrl (SEQ ED NO: 68); (b) contacting the polypeptide 
with the test substance; and (c) determining whether a compound in the test 
substance binds to the polypeptide, wherein binding of the compound to the 
polypeptide identifies it as a compound that blocks efflux of an antibacterial 
agent from a cell. The method may be performed in a cell (e.g., an E. faecalis 
10 cell, or a heterologous cell) or, alternatively, may be performed in a cell-free 
system. 

In another aspect, the invention features a method for detennining 
whether a test substance decreases expression of an MDR efflux pump selected 
from the group consisting of E. faecalis Abel (SEQ ED NO: 2), Abc2 (SEQ ID 

15 NO: 4), Abc3 (SEQ ID NO: 6), Abc4 (SEQ ID NO: 8), Abc5 (SEQ ED NO: 
10), Abc6 (SEQ ID NO: 12), Abc7 (SEQ ID NO: 14), Abc8 (SEQ ID NO: 16), 
Abc9 (SEQ ID NO: 18), AbclO (SEQ ID NO: 20), Abcll (SEQ ED NO: 22), 
Abcl2 (SEQ ED NO: 24), Abcl3 (SEQ ID NO: 26), Abcl4 (SEQ ID NO: 28), 
Abcl5 (SEQ ID NO: 30), Abcl6 (SEQ ED NO: 32), Abcl7 (SEQ ID NO: 34), 

20 Abcl8 (SEQ ED NO: 36), Abcl9 (SEQ ED NO: 38), Abc20 (SEQ ED NO: 40), 
Abc21 (SEQ ID NO: 42), Abc22 (SEQ ED NO: 44), Abc23 (SEQ ED NO: 46), 
Bmr (SEQ ED NO: 48), NorA (SEQ ID NO: 50), Mfl (SEQ ED NO: 52), Mf2 
(SEQ ID NO: 54), Mf3 (SEQ ED NO: 56), Mf4 (SEQ ED NO: 58), Mf5 (SEQ 
ED NO: 60), Mf6 (SEQ ID NO: 62), Mf7 (SEQ ID NO: 64), Matel (SEQ ED 

25 NO: 66), and Smrl (SEQ ED NO: 68). The method includes the steps of: (a) 
providing a cell expressing the MDR efflux pump; (b) contacting the cell with 
the test substance; and (c) measuring expression of the MDR efflux pump in 
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the cell, wherein decreased polypeptide expression, relative to a cell not 
contacted with the test substance, indicates that the test substance decreases 
expression of an MDR efflux pump. Expression of the MDR efflux pump can 
be determined by measuring protein levels of the MDR efflux pump or by 
5 measuring levels of RNA encoding the MDR efflux pump. The MDR efflux 
pump can be expressed in any cell (e.g., a Bacillus (B.) subtilis or Lactococcus 
(L.) lactis cell). The foregoing method may also be performed in a cell-free 
system that supports transcription (e.g., whole-cell lysates). 

The invention also features a method for identifying an inhibitor of 

10 an MDR pump that includes the steps of: (a) expressing an MDR efflux pump 
derived from a first bacterial strain in a second bacterial strain to provide 
increased resistance to an antibacterial agent relative to the second bacterial 
strain not expressing the MDR efflux pump; (b) contacting the second bacterial 
strain expressing the MDR efflux pump with an amount of the antibacterial 

15 agent that inhibits growth of a control strain (i.e., the second bacterial strain 
not expressing the MDR efflux pump) but does not substantially inhibit growth 
of the second bacterial strain expressing the MDR efflux pump; (c) contacting 
the second bacterial strain expressing the MDR efflux pump with a test 
substance; and (d) measuring growth of second bacterial strain expressing the 

20 MDR efflux pump, wherein decreased growth of the bacterial strain identifies 
the test substance as one that includes an inhibitor of an MDR efflux pump. 
Step (c) can be performed before step (b), after step (b), or simultaneous with 
step (b). 

The second bacterial strain can be, for example, a strain of B. subtilis 
25 or L lactis. The MDR efflux pump can be selected from the group consisting 
of E.faecalis Abel (SEQ ID NO: 2), Abc2 (SEQ ID NO: 4), Abc3 (SEQ ID 
NO: 6), Abc4 (SEQ ID NO: 8), Abc5 (SEQ ID NO: 10), Abc6 (SEQ ID NO: 
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12), Abc7 (SEQ ID NO: 14), Abc8 (SEQ ID NO: 16), Abc9 (SEQ ID NO: 18), 
AbclO (SEQ ID NO: 20), Abcll (SEQ ID NO: 22), Abcl2 (SEQ ID NO: 24), 
Abcl3 (SEQ ID NO: 26), Abcl4 (SEQ ID NO: 28), Abcl5 (SEQ ID NO: 30), 
Abcl6 (SEQ ID NO: 32), Abcl7 (SEQ ID NO: 34), Abcl8 (SEQ ID NO: 36), 
5 Abcl9 (SEQ ID NO: 38), Abc20 (SEQ ID NO: 40), Abc21 (SEQ ID NO: 42), 
Abc22 (SEQ ID NO: 44), Abc23 (SEQ ID NO: 46), Bmr (SEQ ID NO: 48), 
NorA (SEQ ID NO: 50), Mfl (SEQ ID NO: 52), Mf2 (SEQ ID NO: 54), Mf3 
(SEQ ID NO: 56), Mf4 (SEQ ID NO: 58), Mf5 (SEQ ID NO: 60), Mf6 (SEQ 
ID NO: 62), Mf7 (SEQ ID NO: 64), Matel (SEQ ID NO: 66), and Smrl (SEQ 

10 ID NO: 68). 

The invention also features a method for increasing the sensitivity of 
a bacterial cell to an antibacterial agent, the method including the step of 
contacting the cell with a compound that blocks efflux of the antibacterial 
agent from the cell by binding to an MDR efflux pump selected from the group 

15 consisting of E. faecalis Abel (SEQ ID NO: 2), Abc2 (SEQ ID NO: 4), Abc3 
(SEQ ID NO: 6), Abc4 (SEQ ID NO: 8), Abc5 (SEQ ID NO: 10), Abc6 (SEQ 
ID NO: 12), Abc7 (SEQ ID NO: 14), Abc8 (SEQ ID NO: 16), Abc9 (SEQ ID 
NO: 18), AbclO (SEQ ID NO: 20), Abcll (SEQ ID NO: 22), Abcl2 (SEQ ID 
NO: 24), Abcl3 (SEQ ID NO: 26), Abcl4 (SEQ ID NO: 28), Abcl5 (SEQ ID 

20 NO: 30), Abcl6 (SEQ ID NO: 32), Abcl7 (SEQ ID NO: 34), Abcl8 (SEQ ID 
NO: 36), Abe 19 (SEQ ID NO: 38), Abc20 (SEQ ID NO: 40), Abc21 (SEQ ID 
NO: 42), Abc22 (SEQ ID NO: 44), Abc23 (SEQ ID NO: 46), Bmr (SEQ ID 
NO: 48), NorA (SEQ ID NO: 50), Mfl (SEQ ID NO: 52), Mf2 (SEQ ID NO: 
54), Mf3 (SEQ ID NO: 56), Mf4 (SEQ ID NO: 58), Mf5 (SEQ ID NO: 60), 

25 Mf6 (SEQ ID NO: 62), Mf7 (SEQ ID NO: 64), Matel (SEQ ID NO: 66), and 
Smrl (SEQ ID NO: 68). The cell can be an K faecalis bacterial cell, but can 
also be, for example, any bacterial cell described herein. 
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In a related aspect, the invention features a method for increasing the 
sensitivity of a bacterial cell to an antibacterial agent, the method including the 
step of contacting the cell with a compound that blocks efflux of the 
antibacterial agent from the cell by decreasing the expression of a gene 
5 encoding an MDR efflux pump selected from the group consisting of E. 

faeccdis Abel (SEQ ID NO: 2), Abc2 (SEQ ID NO: 4), Abc3 (SEQ ID NO: 6), 
Abc4 (SEQ ID NO: 8), Abc5 (SEQ ID NO: 10), Abc6 (SEQ ID NO: 12), Abc7 
(SEQ ID NO: 14), Abc8 (SEQ ID NO: 16), Abc9 (SEQ ID NO: 18), AbclO 
(SEQ ID NO: 20), Abcll (SEQ ID NO: 22), Abcl2 (SEQ ID NO: 24), Abcl3 

10 (SEQ ID NO: 26), Abcl4 (SEQ ID NO: 28), Abcl5 (SEQ ID NO: 30), Abcl6 
(SEQ ID NO: 32), Abcl7 (SEQ ID NO: 34), Abcl8 (SEQ ID NO: 36), Abcl9 
(SEQ ID NO: 38), Abc20 (SEQ ID NO: 40), Abc21 (SEQ ID NO: 42), Abc22 
(SEQ ID NO: 44), Abc23 (SEQ ID NO: 46), Bmr (SEQ ID NO: 48), NorA 
(SEQ ID NO: 50), Mfl (SEQ ID NO: 52), Mf2 (SEQ ID NO: 54), Mf3 (SEQ 

15 ID NO: 56), Mf4 (SEQ ID NO: 58), Mf5 (SEQ ID NO: 60), Mf6 (SEQ ID 
NO: 62), Mf7 (SEQ ID NO: 64), Matel (SEQ ID NO: 66), and Smrl (SEQ ID 
NO: 68). The cell can be an E. faeccdis bacterial cell, but can also be, for 
example, any bacterial cell described herein. 

In still another aspect, the invention features a method for 

20 determining whether a test substance includes a compound that reduces efflux 
of antibacterial agents from a cell. The method includes the steps of: (a) 
providing a non-bacterial cell expressing a nucleic acid molecule encoding an 
MDR efflux pump selected from E. faeccdis Abel (SEQ ID NO: 2), Abc2 
(SEQ ID NO: 4), Abc3 (SEQ ID NO: 6), Abc4 (SEQ ID NO: 8), Abc5 (SEQ 

25 ID NO: 10), Abc6 (SEQ ID NO: 12), Abc7 (SEQ ID NO: 14), Abc8 (SEQ ID 
NO: 16), Abc9 (SEQ ID NO: 18), AbclO (SEQ ID NO: 20), Abcll (SEQ ID 
NO: 22), Abcl2 (SEQ ID NO: 24), Abcl3 (SEQ ID NO: 26), Abcl4 (SEQ ID 
NO: 28), Abcl5 (SEQ ID NO: 30), Abcl6 (SEQ ID NO: 32), Abcl7 (SEQ ID 
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NO: 34), Abcl8 (SEQ ID NO: 36), Abcl9 (SEQ ID NO: 38), Abc20 (SEQ ID 
NO: 40), Abc21 (SEQ ID NO: 42), Abc22 (SEQ ID NO: 44), Abc23 (SEQ ID 
NO: 46), Bmr (SEQ ID NO: 48), NorA (SEQ ID NO: 50), Mf 1 (SEQ ID NO: 
52), Mf2 (SEQ ID NO: 54), Mf3 (SEQ ID NO: 56), Mf4 (SEQ ID NO: 58), 
5 Mf5 (SEQ ID NO: 60), Mf6 (SEQ ID NO: 62), Mf7 (SEQ ID NO: 64), Matel 
(SEQ ID NO: 66), and Smrl (SEQ ID NO: 68); (b) contacting the cell with the 
test substance; (c) contacting the cell with a compound that is capable of (i) 
entering the cell and (ii) being transported by the MDR efflux pump; and (d) 
measuring efflux of the compound from the cell, wherein decreased efflux, 
10 relative to a cell not contacted with the test substance, identifies the test 

substance as containing a compound that reduces efflux of antibacterial agents 
from a cell. The cell can be, for example, a eukaryotic cell, such as a 
mammalian cell, an insect cell, or a yeast cell (a Saccharomyces cerevisiae 
cell). 

15 The invention also features a method for determining whether a test 

substance includes a compound that reduces efflux of antibacterial agents from 
a cell. This method includes the steps of: (a) providing a cell free system 
consisting of: (i) a lipid membrane into which is inserted an MDR efflux pump 
selected from Enterococcus faecalis Abel (SEQ ID NO: 2), Abc2 (SEQ ID 

20 NO: 4), Abc3 (SEQ ID NO: 6), Abc4 (SEQ ID NO: 8), Abc5 (SEQ ID NO: 
10), Abc6 (SEQ ID NO: 12), Abc7 (SEQ ID NO: 14), Abc8 (SEQ ID NO: 16), 
Abc9 (SEQ ID NO: 18), AbclO (SEQ ID NO: 20), Abcll (SEQ ID NO: 22), 
Abcl2 (SEQ ED NO: 24), Abcl3 (SEQ ID NO: 26), Abcl4 (SEQ ID NO: 28), 
Abcl5 (SEQ ID NO: 30), Abcl6 (SEQ ID NO: 32), Abcl7 (SEQ ID NO: 34), 

25 Abcl8 (SEQ ID NO: 36), Abcl9 (SEQ ID NO: 38), Abc20 (SEQ ID NO: 40), 
Abc21 (SEQ ID NO: 42), Abc22 (SEQ ID NO: 44), Abc23 (SEQ ID NO: 46), 
Bmr (SEQ ID NO: 48), NorA (SEQ ID NO: 50), Mfl (SEQ ID NO: 52), Mf2 
(SEQ ID NO: 54), Mf3 (SEQ ID NO: 56), Mf4 (SEQ ID NO: 58), Mf5 (SEQ 
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ID NO: 60), Mf6 (SEQ ID NO: 62), MF7 (SEQ ID NO: 64), Matel (SEQ ID 
NO: 66), and Smrl (SEQ ID NO: 68); (ii) the test substance; and (iii) a 
compound that is capable of being transported by the MDR efflux pump; and 
(b) measuring transport of the compound across the lipid membrane, wherein 
5 decreased transport, relative to transport in a cell-free system not comprising 
the test substance, identifies the test substance as containing a compound that 
reduces efflux of antibacterial agents from a cell. In one example, the lipid, 
membrane is in the form of a liposome. 

By "MDR efflux pump" is meant any cell transporter that has been 

10 functionally shown to actively transport a drug or chemical out of a cell. 

By "substantially identical" is meant that 30 or more consecutive 
nucleotides exhibit at least 50%, preferably 85%, more preferably 90%, and 
most preferably 95% identity to a reference nucleotide sequence. For 
homologous recombination, the length of substantial identity will generally be 

15 at least 30 nucleotides, preferably at least 50 nucleotides, more preferably at 
least 100 nucleotides, and most preferably 200 nucleotides. 

By "high identity" is meant that two sequences share at least 10% of 
nucleotides or 20% of amino acids, when optimally aligned, such as by the 
program BLAST, over a comparison window of 90 nucleotides or 30 amino 

20 acids. Preferably, the percent identity is at least 25%, 30%, 40%, or even 50% 
and more preferably, the identity is at least 70% or even 80%. One sequence 
may include additions or deletions (i.e., gaps) of 20% or less when compared 
to the second sequence. Optimal alignment of sequences may be conducted, 
for example, by the methods of Gish and States (Nature Genet. 3:266-272, 

25 1993), Altshul et al. (J. MoL Biol. 215:403-410, 1990), Madden et al. (Meth. 
Enzymol. 266:131-141, 1996), Althsul et al (Nucleic Acids Res. 25:3389- 
3402, 1997), or Zhang et al (Genome Res. 7:649-656, 1997). 
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By "disruptive mutation'* is meant an alteration in the nucleic acid 
molecule that results in decreased expression or function of the encoded 
protein. Preferably, the decrease is at least 50%, more preferably it is at least 
80%, and most preferably the decrease is >99%. The mutation can be a point 
5 mutation but preferably is an insertional mutation or a deletion mutation. 
Mutations in the coding region and in the region regulating expression (e.g., 
the promoter) are each likely to be disruptive. 

By 'test substance" is meant a compound (or collection of 
compounds) being tested for its ability to inhibit growth or metabolism of a 

10 bacterial strain. Specifically excluded are compounds that were previously 
known in the public domain to inhibit the growth or metabolism of that species 
of bacteria. For example, erythromycin would not be considered to be a test 
substance when administered to a culture of E. faecalis because it was known 
that erythromycin inhibited growth or metabolism of that bacterium. 

15 Any test substance can be used in the present screening methods, 

including naturally occurring substances and non-naturally occurring 
substances. Exemplary test substances are low molecular weight substances 
produced by living organisms or other substances, soluble in the growth 
medium, having a molecular weight between 150 and 750 daltons. Test 

20 substances can be individual compounds or libraries of compounds. One 
skilled in the art will recognize that there are numerous sources for the test 
substance, and any of these is suitable for use in the present screening method. 

Determining whether the growth or metabolism of a cell is inhibited 
can be performed using any of a number of standard assays, such as those 

25 described herein. Cellular metabolism can be measured, for example, by 
assessing the activity of one or more enzymes that function in cellular 
metabolic pathways. Alternatively, cellular metabolism can be measured by 
any of the surrogate markers that are known to cell biologists. Methods for 
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determining growth or metabolism include physical observation (e.g., MIC 
determination) and measuring turbidity (e.g., absorbance at 650 nm; IC^ 
determination). 

The method for determining growth or metabolism can be adapted 
5 for high throughput screening, for example, by culturing the cells in multi-well 
or microliter plates and measuring growth or metabolism 
spectrophotometrically. 

In the absence of any test substance, a bacterial strain having a 
mutation may exhibit a different rate of growth or metabolism than that 

10 exhibited by the parental strain not having the mutation. Accordingly, it is 
desirable to determine whether a test substance inhibits the growth or 
metabolism of a cell by comparing the growth or metabolism of a bacterial 
strain in the presence of the test substance with the growth or metabolism of 
that strain in the absence of the test substance. 

15 A test substance is considered to inhibit the growth or metabolism of 

a bacterium if it decreases growth or metabolism by at least 10% when 
administered at a final concentration of up to 100 fig/ml to a culture of that 
bacterium, as determined using one of the foregoing assays. Preferably, 
growth or metabolism is decreased by at least 25%, more preferably by at least 

20 50%, and most preferably by at least 75%. 

The invention provides methods and reagents for (i) identifying 
MDR efflux pumps (ii) knocking out the function of one or more genes 
encoding MDR efflux pumps in E. faecalis, (iii) using these mutated bacterial 
strains to identify novel antibacterial agents, and (iv) using the novel genes 

25 encoding MDR efflux pumps to identify compounds that increase the 

sensitivity of a bacterial cell to an antibacterial agent. The gene disruption 
techniques are broadly applicable for knocking out genes in a wide variety of 
bacteria, including, for example, Staphylococcus aureus, Streptococcus 
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pyogenes, Bacillus anthracis, Clostridium tetania Clostridium bolulinum, 
Vibrio cholerae, Helicobacter pylori, Salmonella typhimurium, Shigella 
dysenteriae, Bordetella pertussis, Yersinia pestis, Pseudomonas aeruginosa, 
Neisseria gonorrhoeae, Mycobacterium tuberculosis, Corynebacterium 
5 diptheriae, Borrelia burdorferi, Treponema pallidum, Enterococcus faecalis, 
Enterococcus faecium, and Streptococcus pneumoniae. 

Other features and advantages of the invention will be apparent from 
the following description of the preferred embodiments thereof, and from the 
claims. 

10 Brief Description of the Drawings 

Fig. 1 is a schematic illustration showing single gene knockout 
generation by gene disruption. Bacterial cells in which the target gene has 
been disrupted are isolated by selection in the presence of kanamycin. This 
method cannot be used for the generation of multigene knockout strains 
15 because once the kanamycin-bearing plasmid is inserted, a second kanamycin- 
bearing plasmid can no longer be used for selection of another gene disruption. 
Additionally, the plasmid backbone integrated into the genome provides an 
area for non-targeted homologous recombination. 

Figs. 2A-2F are a series of schematic illustrations showing JS. 
20 faecalis MDR-knockout strains are more sensitive to antibacterial agents. 

Fig. 3 is a schematic illustration showing gene knockout by 
disruption, followed by marker and gene target deletion. Sequences A and B, 
which flank the target gene, are amplified by PCR and cloned into a vector that 
provides kanamycin resistance. The bacteria are transformed with this vector 
25 and selected for kanamycin-resistant bacteria (thus selecting for bacteria in 
which the vector has integrated into the chromosomal DNA due to a first 
crossover). Subsequentiy, the cells are grown in the absence of kanamycin. 
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During this phase, if a second crossover occurs, the target gene and the 
kanamycin resistance gene are each lost Thus, the desired bacteria will die in 
the presence of kanamycin. These kanamycin-sensitive cells can be identified 
by replicate plating, as described herein. 
5 Fig. 4 is a schematic illustration showing E. faecalis Abel nucleotide 

sequence (SEQ ID NO: 1) and Abel (SEQ ID NO: 2) amino acid sequence. 

Fig. 5 is a schematic illustration showing K faecalis Abc2 nucleotide 
sequence (SEQ ID NO: 3) and Abc2 amino acid sequence (SEQ ID NO: 4). 

Fig. 6 is a schematic illustration showing E. faecalis Abc3 nucleotide 
10 sequence (SEQ ID NO: 5) and Abc3 amino acid sequence (SEQ ID NO: 6), 

Fig. 7 is a schematic illustration showing E. faecalis Abc4 nucleotide 
sequence (SEQ ID NO: 7) and Abc4 amino acid sequence (SEQ ID NO: 8). 

Fig. 8 is a schematic illustration showing E. faecalis Abc5 nucleotide 
sequence (SEQ ID NO: 9) and Abc5 amino acid sequence (SEQ ID NO: 10). 
15 Fig. 9 is a schematic illustration showing E. faecalis Abc6 nucleotide 

sequence (SEQ ID NO: 1 1) and Abc6 amino acid sequence (SEQ ID NO: 12). 

Fig. 10 is a schematic illustration showing E. faecalis Abc7 
nucleotide sequence (SEQ ID NO: 13) and Abc7 amino acid sequence (SEQ 
ID NO: 14), 

20 Fig. 1 1 is a schematic illustration showing E. faecalis Abc8 

nucleotide sequence (SEQ ID NO: 15) and Abc8 amino acid sequence (SEQ 
ID NO: 16), 

Fig. 12 is a schematic illustration showing R faecalis Abc9 
nucleotide sequence (SEQ ID NO: 17) and Abc9 amino acid sequence (SEQ 
25 ID NO: 18), 

Fig. 13 is a schematic illustration showing E. faecalis AbclO 
nucleotide sequence (SEQ ID NO: 19) and AbclO amino acid sequence (SEQ 
ID NO: 20), 
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Fig. 14 is a schematic illustration showing E.faecalisAbcll 
nucleotide sequence (SEQ ID NO: 21) and Abcll amino acid sequence (SEQ 
ID NO: 22), 

Fig. 15 is a schematic illustration showing E. faecalis Abcll 
5 nucleotide sequence (SEQ ID NO: 23) and Abcl2 amino acid sequence (SEQ 
ID NO: 24), 

Fig. 16 is a schematic illustration showing E. faecalis Abe 13 
nucleotide sequence (SEQ ID NO: 25) and Abcl3 amino acid sequence (SEQ 
ID NO: 26), 

10 Fig. 17 is a schematic illustration showing E. faecalis Abcl4 

nucleotide sequence (SEQ ID NO: 27) and Abcl4 amino acid sequence (SEQ 
ID NO: 28), 

Fig. 18 is a schematic illustration showing E. faecalis Abel 5 
nucleotide sequence (SEQ ID NO: 29) and Abcl5 amino acid sequence (SEQ 
15 ID NO: 30), 

Fig. 19 is a schematic illustration showing E. faecalis Abel 6 
nucleotide sequence (SEQ ID NO: 31) and Abcl6 amino acid sequence (SEQ 
ID NO: 32), 

Fig. 20 is a schematic illustration showing E. faecalis Abel 7 
20 nucleotide sequence (SEQ ID NO: 33) and Abcl7 amino acid sequence (SEQ 
ID NO: 34), 

Fig. 21 is a schematic illustration showing E. faecalis Abcl8 
nucleotide sequence (SEQ ID NO: 35) and Abe 18 amino acid sequence (SEQ 
ID NO: 36), 

25 Fig. 22 is a schematic illustration showing E. faecalis Abcl9 

nucleotide sequence (SEQ ID NO: 37) and Abcl9 amino acid sequence (SEQ 
ID NO: 38), 
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Fig. 23 is a schematic illustration showing E.faecalis Abc20 
nucleotide sequence (SEQ ID NO: 39) and Abc20 amino acid sequence (SEQ 
ID NO: 40), 

Fig. 24 is a schematic illustration showing E.faecalis Abc21 
5 nucleotide sequence (SEQ ID NO: 41) and Abc21 amino acid sequence (SEQ 
ID NO: 42), 

Fig. 25 is a schematic illustration showing E. faecalis Abc22 
nucleotide sequence (SEQ ID NO: 43) and Abc22 amino acid sequence (SEQ 
ID NO: 44), 

10 Fig. 26 is a schematic illustration showing E. faecalis Abc23 

nucleotide sequence (SEQ ID NO: 45) and Abc23 amino acid sequence (SEQ 
ID NO: 46), 

Fig. 27 is a schematic illustration showing E.faecalis Bmr 
nucleotide sequence (SEQ ID NO: 47) and Bmr amino acid sequence (SEQ ID 
15 NO: 48), 

Fig. 28 is a schematic illustration showing E faecalis NorA 
nucleotide sequence (SEQ ID NO: 49) and NorA amino acid sequence (SEQ 
ID NO: 50), 

Fig. 29 is a schematic illustration showing E faecalis Mfl 
20 nucleotide sequence (SEQ ED NO: 51) and Mfl amino acid sequence (SEQ ID 
NO: 52), 

Fig. 30 is a schematic illustration showing E.faecalis Mf2 
nucleotide sequence (SEQ ID NO: 53) and Mf2 amino acid sequence (SEQ ID 
NO: 54), 

25 Fig. 3 1 is a schematic illustration showing E. faecalis Mf3 

nucleotide sequence (SEQ ID NO: 55) and Mf3 amino acid sequence (SEQ ID 
NO: 56), 
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Fig. 32 is a schematic illustration showing E. faecalis Mf4 
nucleotide sequence (SEQ ID NO: 57) and Mf4 amino acid sequence (SEQ ED 
NO: 58), 

Fig. 33 is a schematic illustration showing E. faecalis Mf5 
5 nucleotide sequence (SEQ ID NO: 59) and Mf5 amino acid sequence (SEQ ID 
NO: 60), 

Fig. 34 is a schematic illustration showing E. faecalis Mf6 
nucleotide sequence (SEQ ID NO: 61) and Mf6 amino acid sequence (SEQ ID 
NO: 62), 

10 Fig. 35 is a schematic illustration showing E. faecalis Mf7 

nucleotide sequence (SEQ ID NO: 63) and Mf7 amino acid sequence (SEQ ID 
NO: 64), 

Fig. 36 is a schematic illustration showing E. faecalis Matel 
nucleotide sequence (SEQ ID NO: 65) and Matel amino acid sequence (SEQ 
15 ID NO: 66), 

Fig. 37 is a schematic illustration showing E. faecalis Smrl 
nucleotide sequence (SEQ ID NO: 67) and Smrl amino acid sequence (SEQ 
ID NO: 68). 

Detailed Description 

20 We have invented a method for identifying MDR efflux pumps by 

first mining bacterial genome databases to identify candidate pumps, and then 
validating function using a disruption-mediated gene knockout strategy. Using 
this method, we have discovered over thirty candidate MDR efflux pumps in 
E. faecalis. Gene knockout experiments to date have verified that at least five 

25 of these efflux pumps do function as MDR efflux pumps. 
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We have also discovered methods for generating bacterial strains in 
which two or more MDR efflux pumps have been mutated by gene deletion. 
These bacterial strains are useful for screening test substances for their ability 
to inhibit the growth or metabolism of bacterial cells. 

5 Example 1: Genomic Sequence Database Mining to Identify Candidate 
Drug-Efflux Pumps in E. faecalis 

We used the DNA sequence and membrane topology of several 
known MDR efflux pumps to probe, using a series of BLAST searches, for 
homologies within the genomic sequence of E. faecalis (Worley et al., 

10 Genome Res. 5:173-184, 1995; see Table 1; the database can be accessed at 
http://www.tigr t org). The amino acid sequence from a known MDR efflux 
pump from other bacterial and fungal species were used as probes in BLAST 
searches of the TIGR database (WU-BLAST version 2.0). A list of contigs 
and their P-values was obtained. P-values under the cut-off of le-20 were 

15 considered for further analysis. The ORFs from each contig were located and 
examined for the presence of predicted transmembrane coding regions using 
Sosui version 1.0/10 (Hirokawa et al., Bioinformatics 14:378-379, 1998). For 
the ABC family, ABC signature sequences and ATP/GTP binding site motif A 
(P-loop) using Motif finder (http://www.motif.genome.ad.jp/). ORFs that 

20 satisfied the foregoing criteria were then used as probes in BLAST searches 
against protein databases in order to identify ORFs having orthologues that are 
known MDR efflux pumps (it is not necessary that an ORF have an orthologue 
that is a known MDR efflux pump; this step aids in prioritizing candidate 
MDR efflux pumps for testing). 
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Table 1 



Organism 


MDRPump Gene 


Transporter Family 


Accession Number 


S. aureus 


NorA 


MF 


P21191 


B. subtUis 


Bmr 


MF 


P33449 


B. subtUis 


Bit 


MF 


L32599 


S. pneumoniae . 


PmrA 


MF 


AJ07367 


L lactis 


LmrP 


MF 


X89779 


L lactis 


LnvrA 


ABC 


U63741 


S. epidermidis 


MsrA 


ABC 


X52085 



10 The ABC Family 

The members of the ABC family of proteins share an ATP binding 
cassette sequence motif that enables the identification of E. faecalis sequences 
having high identity on the amino acid level to L lactis LmrA and S. 
epidermidis Msx A (designated Abel through Abc23, respectively; Table 2). 

15 Only a minority of ABC proteins are predicted to act as drug transporters, and, 
at present, there is no easy way to distinguish MDR-ABC proteins from those 
that perform any of the myriad of non-MDR related functions within the cell. 
Thus, a second step of the method includes functional testing of the candidate 
encoded polypeptides by generating disruption or deletion mutants (see 

20 Examples 2 and 4). 

The Multiple Facilitator Superfamily 

We scanned the E. faecalis genomic database using as probes a set 

of known MDR genes of the MF superfamily from a set of Gram-positive 

bacteria (e.g. NorA, Bmr, Bit, PmrA; Table 1). We identified a set of 

25 sequences having high identity to one or more of the probes; these sequences 

were designated Mf 1 through Mf7 (Table 2). The E. faecalis genes designated 

NorA and Bmr were labeled such because of the high sequence identity to the 
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equivalent genes in S. aureus and B. subtilis, respectively (Table 1). Other MF 
genes in E. faecalis are also highly similar to these genes. As with the ABC 
homologues, the MF family of proteins most likely have a broad range of 
functions, many of which are unrelated to MDR. Therefore, functional testing 
5 by drug sensitivity is performed to definitively designate these proteins as 
MDR efflux pumps (see below). 

MATE and Smr Families 

A family of transporters termed MATE (multidrug and toxic 
compound extrusion transporters) has been described based on sequence 

10 homology to the NorM multidrug efflux protein of V. parahaemolyticus and 
the YdhE gene product of E. coli and Haemophilus (H.) influenzae (Morita et 
al., Antimicrob. Agents Chemother. 42:1778-1782, 1998; Brown et al., Mol. 
Microbiol. 31:394-395, 1999). The Smr family consists of proteins with four 
transmembrane-spanning regions, and the MDR efflux proteins within this 

15 family are the smallest known secondary transporters (Paulsen et al., Mol. 
Microbiol. 19: 1167-1 175, 1996). Using the E. faecalis genomic database, we 
scanned for potential members of the MATE transporter protein family with B. 
subtilis Ypn, YisQ, and YucE, and H. influenza YdhE, and for potential 
members of the Smr transporter protein family with B. subtilis EbrA and EbrB. 

20 By conducting orthologue searching of MDR efflux pump gene families, 
we identified candidates for gene-knockout generation and drug sensitivity 
analysis in E. faecalis (Table 2). 
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Table 2 



E.faecalis 


Homologue 


Organism 


Accession # 


E value" 


sequence identifier* 










Abel (SEQ ID NO: 2) . 


Tap-tub2 


M. tuberculosis 


Q11047 


9.20E-114 


(Contig6453) 


YfiC 


B. subtilis 


P54719 


4.80E-110 




£11 Ul 


c. con 


* OKif JX 


iiJUE-lv / 




Tap-tub 


M. tuberculosis 


Q11046 


3.50E-97 




AtrD 


A. nidulans 


AF071411 


1.80E^90 




afuMDRl 


A fumigatus 


U62934 


3.30E-90 


Abc2 (SEQ ID NO: 4) 


Tap-tub2 


M. tuberculosis 


Q11047 


4.70E-123 


(Contig6216) 


mdl 


E. coli 


P30751 


5.30E-117 




YfiC 


B. subtilis 


P54719 


6.90E-114 




AtrD 


A. nidulans 


AF071411 


&90E-106 




YfiB 


B. subtilis 


P54718 


9.30E-103 


Abc3 (SEQ ID NO: 6) 


Yhel 


B. subtilis 


CAB12811 


8.40E-106 


(Contig 6222) 


ComA 


S. pneumoniae 


M36180 


6.30E-57 




afuMDRl 


A. fumigatus 


U62934 


1.50E-56 


Abc4 (SEQ ID NO: 8) 


Tap-tub2 


M. tuberculosis 


Q11047 


L70E-143 




inu 


i>. SIWIUIS 




o qc\c in 
z.oUc-n / 


Abc5(SEQIDNO: 10) 


Tap-tub 


M. tuberculosis 


Q11046 


1.80E-115 


(Contig 6207) 


mdl 


E. coli 


P30751 


1.40E-104 




AtrD 


A. nidulans 


AF071411 


4.50E-103 




YfiB 


B. subtilis 


P54718 


1.70E-101 




afuMDRl 


A ftimiQfihAX 


U62934 


4 90E-96 




LmrA 


Z*. lactis 


U63741 


5 20E-73 




ComA 


S. pneumoniae 


M36180 


8 40E-58 




nfrnrfrl 


P falrinamm 


M29154 


2 80E-56 < 


Abc6(SEQIDNO: 12) 


LmrA 


JL lactis 


U63741 


6.00E-75 


(Contig 6344) 


ComA 


S. pneumoniae 


M36180 


8.00E-63 




afuMDRl i 


A fumipotus 


U62934 


5 60E-56 


Abc7(SEQIDNO:14) 


AtrD 


A. nidulans 


AF071411 


5.30E-106 


(Contig 6226) 


YfiC 


B. subtilis 


P54719 


6.90E-105 




afuMDRl 


A fumigatus 


U62934 


4.70E-103 




YfiB 


B. subtilis 


P54718 


6.80E-100 




LmrA 


L lactis 


U63741 


1.70E-71 




ComA 


S. pneumoniae 


M36180 


1.40E-57 


Abc8(SEQIDNO: 16) 


mdl 


E. coli 


P30751 


4.50E-134 


(Contig 6358) 


AtrD 


A nidulans 


AF071411 


8.20E-115 




YfiC 


B. subtilis 


P54719 


5.80E-108 




afuMDRl 


A. fumigatus 


U62934 


1.20E-102 




Yhel 


B. subtilis 


CAB12811 


4.90E-90 | 




LmrA 


L lactis 


U63741 


. 3.10E-71 


Abc9(SEQIDNO: 18) 


SunT 


B. subtilis 


CAB 14065 


7.70E-58 


(Contig 6217) 
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E.faecalis 
sequence identifier* 


Homologue 


Organism 


Accession # 


E value* 




AbclO (SEQ ID NO: 20) 
(Contig6328) 


cydC 


rw • _a . _ 

H. influenza 


AAC228H 


2.40E-51 




Abcll(SEQIDNO:22) 
(Conng 61 03) 


Tap-tub2 
mdl 
AtrD 
Tap-tub 
afuMDRl 


M. tuberculosis 
L. coll 

A. nidulans 
M. tuberculosis 

A fianigatus 


QH047 
F30751 
AF071411 
Q11046 
U62934 


5.60E-109 

i.30fi-yv 

6.10E-97 

2.30E-87 

1.60E-85 


5 

• 


Abcl2(SEQIDNO:24) 
(Contig6317) 


afuMDRl 
AtrD 
pfindrl 


A. fianigatus 
A. nidulans 
P. falciparum 


U62934 
AF071411 
M29154 


3.30E-81 
5.90E-81 
4.30E-66 




Abcl3(SEQroNO:26) 
(Contig6285) 


SunT 


B. subtUis 


CAB 14065 


6.30E-21 


10 


Abcl4(SEQIDNO: 28) 
(Contig 6864) 


mdl 


E. coli 


P30751 


7.20E-92 




Abcl5(SEQIDNO: 30) 
(Contig6183) 


StpC 
DirA 


S. aureus 
S. peucetius 


230588 
P32010 


4.60E-27 
1.00E-25 




Abcl6 (SEQ ID NO: 32) 
(Contig6229) 


MsrA 
SmrB 


S. epidennidis 
S. ambofaciens 


X52085 
X63451 


3.10E-88 ' 
8.20E-31 


15 


Abcl7(SEQEDNO:34) 
(Contig6174) 


SmrB 
TlrC 


S. ambofaciens 
S.fradiae 


X63451 
M57437 


7.10E-62 
8.00E-57 




Abcl8(SEQIDNO: 36) 
(Contig6381) 


SmrB 
TliC 


S. ambofaciens 
S.fradiae 


X63451 
M57437 


5.90E-62 
2.40E-51 


20 


Abcl9(SEQIDNO: 38) 
(Contig 6402) 


TlrC 
SmrB 


S.fradiae 
S. ambofaciens 


M57437 
X63451 


1.00E-46 
1.80E-43 




Abc20(SEQIDNO:40) 
(Contig 6203) 


DrrA 


S. peucetius 


P32010 


2.50E-35 




Abc21(SEQIDNO:42) 
(Contig 6237) 


DrrA 


S. peucetius 


P32010 


3.40E-24 


25 


Abc22(SEQIDNO:44) 
(Contig 6232) 


SmrB 
TlrC 


S. ambofaciens 
S.fradiae 


X63451 
M57437 


5.70E-43 
i 5.00E-40 




Abc23(SEQIDNO:46) 
(Contig 6274) 


MsrA 


S. epidermidis 


X52085 


3.60E-39 


30 


Bmr (SEQ ID NO: 48) 
(Contig 6187) 


PmrA 


S. pneumoniae 


AJ007367 


7.30E-104 




NorA (SEQ ID NO: 50) 
(Contig 6353) 


NorA 
Bmr 
Bit 


S. aureus 
B. subtilis 
B. subtilis 


SWP21191 
SWP33449 
EML32599 


1.00E-73 
1.60E-89 
4.60E-92 
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E.faeca!is 


Homologue 


Organism 


Accession # 


E value" 


sequence identifier* 










Mfl (SEQ ED NO: 52) 


Ptr 


5. pristinaespiralis 


X84072 


4.00E-59 1 


(Contig 6336) 


LfrA 


M. smegmatis 


U40487 


5.20E-51 


Mmr 


B. subtilis 


Q00538 


1.40E-48 




QacA 


S. aureus 


GB X56628 


1.80E-48 




SmvA 


S. typhimurium 


P37594 


9.00E-45 




EmrB 


E. coli 


BAA16553 


1.40E-39 




Bmr3 


B. subtilis 


Z99105 


2.20E-34 


Mf2 (SEQ ID NO: 54) 


Bmr3 


B. subtilis 


Z99105 


7.80E-62 


(Contig 6226) 


YfiU 


B. subtilis 


BAA24461 


1.80E-46 




EmrB 


E. coli 


BAA16553 


4.60E-43 




Mmr 


B. subtilis 


Q00538 


2.00E-42 




Ptr 


S. pristinaespiralis 


X84072 


1.10E-39 




SmvA 


S. typhimurium 


P37594 


1.40E-34 




LfrA 


M. smegmatis 


U40487 


1.40E-32 




QacA 


S. aureus 


X56628 


4.60E-30 


Mf3 (SEQ ID NO: 56) 


pmrA 


S. pneumoniae 


AJ007367 


3.30E-85 


(Contig 6443) 


NorA 


S. aureus 


P21191 


3.80E-27 




Bit 


B. subtilis 


L32599 


2.60E-22 




Bmr 


B. subtilis 


P33449 


3.10E-22 


Mf4 (SEQ ID NO: 58) 


YcnB 


B. subtilis 


BAA09056 


2.30E-129 


(Contig 6236) 


LmrB 


B. subtilis 


BAA22228 


4.10E-95 




EmrB 


E. coli 


BAA16553 


8.80E-49 




Ptr 


S. pristinaespiralis 


X84072 


1.90E-39 




Mmr 


B. subtilis 


Q0O538 


6.60E-37 




Bmr3 


B. subtilis 


Z99105 


4.90E-32 




QacA 


S. aureus 


X56628 


4.80E-30 


Mf5 (SEQ ID NO: 60) 


EmrB 


E. coli 


BAA16553 


1.40E-46 


(Contig 6480) 


Mmr 


B. subtilis 


Q00538 


2.70E-43 




Bmr3 


B. subtilis 


Z99105 


7.90E-39 




QacA 


S. aureus 


X56628 


5.10E-37 




T -fUA 

LfrA 


M. smegmatis 


U404o7 


i.yUii-31 j 




Bcr 


E. coli 


PRJN0659 


6.20E-15 


MlO (oKQ ID JNU: oZ) 


T m«>T2 

LmrB 


D. SUDtlllS 




l.yuc-ov i 


(Contig 6248) 


YcnB 


B. subtilis 


BAA09056 


4.90E-54 




Mmr 


B. subtilis 


Q0O538 


8.30E-29 




QacA 


S. aureus 


X56628 


1.80E-27 




Bmi3 


B. subtilis 


Z99105 


3.60E-26 


Mf7 (SEQ ID NO: 64) 


YdgK 


B. subtilis 


BAA09056 


4.70E-28 


(Contig 6473) 


EmrD 


E. coli 


P31442 


4.20E-27 
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E.faecalis 


Homologue 


Orcanism 


Accession # 


E value" 


sequence identifier* 










Matel(SEQIDNO:66) 


YpnP 


B. subtilis 


P54181 


1.10E-13 


(Contig6563) 


YucE 


B. subtilis 


CAB 12921 


3.90E-10 




YdhE 


H. influenza 


P45272 


9.90E-08 




YisQ 


B. subtilis 


CAA70645 


6.00E-O7 


Smrl(SEQIDNO:68) 


EbrA 


B. subtilis 


CAB 13614 


2.60E-24 


(Contig6338) 


EbrB 


B. subtilis 


CAB 13613 


3.70E-20 



5 The E- value relates to the Smallest Sum probability of the number of hits one can expect to see by 

chance when searching a database of a certain size. The analysis was performed according to Gish and 
States (Nature Genet. 3:266-272, 1993) and Worley et al. (Genome. Res. 5: 173-184, 1995). 
♦denotes contig number in TIGR database (http://www.tigr.org) 

Example 2: Development of Tools to Generate Gene-Knockout Mutants 

10 The function of newly discovered genes can be studied by 

gene-specific mutagenesis, commonly referred to as a gene-knockout, followed 
by phenotypic characterization. Gene-knockout mutants in microorganisms are 
usually generated by the following steps: (a) construction of a vector that 
contains a portion of the target gene to be inactivated, (b) introduction into the 

15 host organism of the knockout vector, (c) recombination of the knockout 

vector into the target gene via homologous recombination such that integration 
disables gene function, and (d) selection of the knockout mutant using a 
dominant drug selection marker (e.g., kanamycin). The aforementioned 
strategy has been used widely in bacteria and fungi and is well developed for 

20 use in the model Gram-negative bacterium, E. coli, and Gram-positive 

bacterium, B. subtilis, respectively (Link et al., J. Bacteriol. 179:6228-6237, 
1997; Vagner et al., Microbiology 144:3097-3104, 1999). 

Prior to the present invention, knockout mutations in E. faecalis 
were generated by conjugation and transposition (see, for example, Teng et al. 

25 Plasmid 39:182-186, 1998) or, alternatively, by insertional single gene 

knockoute (Qin et al., Antimicrob. Agents Chemother. 42:2883-2888, 1998). 
Using these methods, predominantly insertional mutations were introduced. In 
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this case, there would be no way for removing the antibiotic selection marker 
without reconstituting the target gene of interest. Thus, the disruption methods 
cannot be adapted for the generation of multigenic knockouts. 

In order to functionally identify the genes responsible for MDR 
5 efflux in K faecalis, we developed a method for disrupting genes by 

homologous recombination. To facilitate the performing of this method, we 
developed gene-disruption plasmid vectors and transformation and selection 
methods suitable for use in E. faecalis. The single-gene disruption strategy is 
illustrated in Fig. 1. Briefly, a fragment of the target gene is generated from 

10 the E. faecalis genomic DNA by PCR using primers based on the identified 
gene sequence. Generally, the optimal size of the partial-gene fragment is 
approximately 800 bp. Although each primer pair and PCR reaction required 
optimization to generate with fidelity the gene fragment, we successfully 
cloned a very significant proportion of the genes listed in Table 2, including 

15 Abel, Abc2, Abc3, Abc4, Abc5, Abc6, Abc7, Abc8, Abc8, AbclO, Abel 1, 
Abcl3, Abcl4, Abcl5, Abcl6, Abcl7, Abc20, Abc22, Abe 23, Mfl, Mf2, 
Mf3, Mf4, Mf5, Mf6, Mf7, and Matel. PCR-generated gene fragments are 
. then cloned into the BamHl site of the pBS/Kan plasmid vector and 
transformed into E. coli for selection on kanamycin. Each vector construction 

20 is validated by restriction digest mapping using standard methods. The 
knockout vectors are each individually introduced into E. faecalis by 
electroporation into the wild-type strain OGRF1 (see below for protocol). 
Since the pBS/Kan vectof is designed for replication in E. coli and does not 
contain a Gram-positive origin of replication, it fails to autonomously replicate 

25 in Gram-positive bacteria (Teng et al., Plasmid 39:182-186, 1998). Hence, E. 
faecalis transformants selected on kanamycin are presumed to have the vector 
integrated into the chromosome, as illustrated in Fig. 1. 
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We developed methods to introduce the gene-disruption vectors into 
E. faecalis to generate gene-disruption mutants. Briefly, electrocompetent 
cells were prepared from a 1 liter culture of strain OGRF1 grown in Brain 
Heart Infusion (BHI) medium as follows. Cells were grown until late log 
5 phase, recovered by centrifiigation, and the pellet washed in sterile 10% 
glycerol. After four washing steps, the cells were resuspended in 2 ml sterile 
10% glycerol in water and stored at -70°C. Electroporation was performed 
with 3 fig DNA mixed with 50 fil cells in a 0. 1 cm cuvette. An Invitrogen 
Electroporator was used with settings at 50 /zF, 200 Ohm and 1.25 KV/cm. 

10 One milliliter Todd Hewitt (TH) broth was added immediately after pulsing 
and the cells were incubated at 37°C with agitation at 225 rpm. Transfonnants 
were selected on BHI agar containing kanamycin at 2 mg/ml and observed 
after 24 hours of growth. Transformation efficiency was consistently 1-6 
transformants per 3 jig of DNA. 

15 The genomic sequence in the TIGR database is derived from strain 

V583. Since strain V583 is a vancomycin-resistant clinical isolate, we chose 
for safety reasons to use the vancomycin-sensitive strain, OGRF1, for all our 
laboratory manipulations. There are small sequence differences between the 
strains, and PCR analysis indicates that OGRF1 lacks the gene products Abc9, 

20 Abcl2 and Smrl. 

The putative MDR-knockout mutants were examined individually 
utilizing a PCR-based strategy to demonstrate that integration of the knockout 
plasmid had occurred at the targeted site via homologous recombination. The 
results of this process verified that the integration events were targeted at the 

25 appropriate site in a very high proportion of the transformants examined. We 
generated a large panel of gene knockout mutants using the techniques 
described herein. 
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Example 3: Methods for Determining Growth Inhibition 

One can readily test bacteria for growth inhibition by test substances 
(e.g., natural product extracts) using visual observation (MIC determination) or 
turbidity (absorbance at 650 nm; IC 50 determination). Growth inhibition is 
5 calculated as the percentage decrease in absorbency, compared to the positive 
control cells (e.g., the same bacterial strain in the absence of any test 
substance). We have developed a 96-well microplate-based assay that is 
suitable for drug-susceptibility testing of E. faecalis. In this assay, E. faecalis 
is inoculated into BHI broth at OD^ = 0.08 to 0.1. This stock cell suspension 

10 is diluted 1:300 into BHI medium, and 98 /il of the cell suspension is added to 
2 jLil of antibacterial agent in a microtiter plate and grown overnight at 37°C. 
When the disruption mutants are being tested, the appropriate antibiotic (2 
mg/ml kanamycin) is added into the BHI medium. Turbidity readings are 
measured at OD 650nm and used to calculate the IC 50 values. MICs are also 

15 recorded at this time. 

An example of the foregoing method now follows, in which 
single-gene disruption mutants were contacted with one of more than 40 
different antimicrobial agents from disparate chemical classes. A large 
proportion of the single-gene disruption mutants did not exhibit sensitivity in 

20 comparison with the wild-type strain, OG1RF. In 5. cerevisiae, it is known 
that several MDR pumps have overlapping sets of substrate specificities so that 
single-MDR-gene disruption strains may exhibit marginal drug sensitivity. In 
contrast, multiple-MDR-gene knockout strains exhibit markedly increased 
hypersensitivity as compared to the respective single-MDR-gene disruption 

25 (Mahe et al., J. Biol. Chem. 271:25167-25172, 1996; Heckenstein et al., Yeast 
15:133-137, 1999). We now demonstrate that multiple-MDR-gene disruption 
strains of E. faecalis also exhibit increased hypersensitivity as compared to 
single MDR-gene knockout mutants (see Example 4). Moreover, using the 

-30- 



WO 01/79257 



PCT/US01/12230 



single-gene disruption mutants, several MDR-efflux pumps were identified in 
E. faecalis along with their spectrum of drug substrates (Table 3). These MDR 
efflux pumps required functional analysis for unequivocal identification since, 
in most cases, sequence comparison analysis alone was 
5 inadequate. For example, Abc7, Abcl6, and Abc23 are not particularly distinct 
from the numerous other ABC genes (e.g., B. subtilis NatA\ Cheng et aL, Mol. 
Microbiol. 23:1107-1120, 1997), and the sequence similarity alone gives no 
indication that these transporters are MDR efflux pumps (Table 2). 

The single-gene MDR-disruption strains were tested against known 

10 antibacterial agents at multiple drug concentrations to define the MIC (Table 
3). A subset of the single-gene MDR-disruption strains that exhibited drug 
sensitivity was selected for more detailed analysis and the dose-response curve 
and ICsq were determined for each. A comparison of the growth of 
gene-disruption strains ANorA, AAbc7, AAbcl6 and AAbc23 versus the 

15 wild-type strain in the presence and absence of antibacterial agents is shown in 
Fig. 2. 
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Table 3 



Gene 


Drug 


MIC Oig/ml) 


WT/Agene ratio 


knockout 




wild-type strain 


Agene 




AAbc7 


Doxorubicin 


>100 


136 


64 




riannnniKi^in 
.L/dUlIUI UUIK^IU 


25 


3.12 


ft 
o 




liuuuiuiii JDnjiniuc 


25 


6.25 


A 




Ofloxacin 


12.5 


3.12 


4 




Erythromycin 


1.56 


0.78 


2 




Gentamicin 


50 


25 


2 




Mithramycin 


0.2 


.01 


2 




Norfloxacin 


6.25 


3.12 


2 




Novobiocin 


LZ.J 


O.ZJ 


o 

Z 




Sanguinarine 


IZ.J 


O.Aj 


2 


Mbcl6 


Erythromycin 


1.56 


0.1 


16 




Daunorubicin 


25 


12.5 


2 




Sanguinarine 


12.5 


6.25 


2 


AAbc23 


Clindamycin 

«* * ■ ■ J will 


25 


0.05 


500 




Lincomycin 


50 


0.1 


500 




Virginiamycin 


100 


1.56 


64 




Synercid 


25 


0.39 


04 


ANorA 


Acriflavin 


12.5 


6.25 


2 




Ciprofloxacin 


1.56 


0.78 


2 




Clindamycin 


50 


25 


2 




Erythromycin 


1.56 


0.78 


2 




Norfloxacin 


6.25 


3.12 


2 




Novobiocin 


12.5 


6.25 


2 


Bmr 


Clindamycin 


50 


25 


2 




Erythromycin 


1.56 


0.78 


2 



Example 4: Generation of Multiple Gene-Knockouts in a Single Strain 

10 The single-gene disruption strategy described in Example 2 

precludes the iterative use of the pBS/Kan vector to generate multiple insertion 

mutations in a single strain because of the presence of the kanamycin 

selectable marker following the first knockout. Furthermore, the re-use of the 

same plasmid backbone would likely result in homologous recombination in 

15 any subsequently introduced plasmid with shared sequence. To circumvent the 

foregoing problems, we developed an alternative strategy to develop methods 

for the generation of multiple site-specific disruption mutations in a single 

strain. We developed molecular tools to enable the facile generation of 
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site-directed multiple-knockout mutations utilizing an iterative process of (i) 
plasmid insertion and selection followed by (ii) screening for excision and 
target-gene deletion. The plasmid vectors and gene-deletion scheme are 
illustrated in Fig. 3 and described in detail below. 
5 Targeted in-frame deletions are constructed in three steps. In the 

first step, a vector is constructed that contains sequences flanking the region to 
be deleted. In the second step, this vector is inserted into the bacterial 
chromosome by homologous recombination (referred to as a crossover) with 
one of these flanking regions (sequence A or sequence B in Fig. 3). Bacteria 
10 in which the first crossover has occurred are selected for their resistance to 

kanamycin. In the third step, a second crossover occurs with the other flanking 
region, resulting in loss of the inserted vector (including the gene conferring 
resistance to kanamycin) and of the target gene in the chromosome (Fig. 3). 
These three steps are described in more detail below. 

15 Step 1. Construction of a Vector with Flanking Homologies by Crossover PCR 
Primers are designed that amplify regions flanking the gene of 
interest. PCR products of both flanking regions are preferably at least 500 bp, 
and, even more preferably, at least 1 kb. PCR products should contain bases 
that overlap between them so that they can hybridize in the second round of 

20 PCR, plus an integral number of codons from the gene of interest to prevent 
polarity in the deletion. In one example, the following bases are added to the 
3* primer on the N-terminal end of the gene: 

S'-CCCATCCACTAAACTTAAACA -3' (SEQ ID NO: 69), while these 
following bases are added to the 5' primer on the C-terminal end of the gene: 
25 5' -TCirTAAGTTTAGTGGATGGG -3' (SEQ ID NO: 70). During the 
first round of PCR, two products will be made, one for each flanking region. 
The products are isolated on a gel to select the correct sized product 

-33- 



WO 01/79257 



PCT/US01/12230 



During a second PCR step, 1 fil of each flanking region product is 
combined and used as a template. In this reaction, only the outside primers are 
used (i.e., the 5' primer from the amplification if the upstream fragment and 
the 3' primer from the amplification of the downstream fragment), and at a 
5 10-fold higher concentration of each. The templates anneal to each other by 
their complementary bases and, thus, the outside primers will amplify the 
combined product (sequences A and B joined together). The product is 
separated on a gel, the correct-sized band is excised, and the isolated DNA is 
ligated into the vector. 

10 Step 2. Insertion into the Chromosome 

The completed vector, including the insert, is transformed into 
bacterial cells using the standard electroporation protocol with selection for 
kanamycin resistance. PCR and/or Southern blot may be used to verify 
targeted integration of this construct into the chromosome. 

15 Step 3. Isolation of Deletion Mutants 

This protocol is based on replicate plating of the strain with the 
inserted construct. In one example, cultures are first grown overnight in BHI 
at 37°C until they reach stationary phase. The overnight culture is diluted 
100-fold in fresh media and grown at 37°C for an additional 90 minutes. One 

20 hundred microliters from 5- to 10-fold dilution is plated to obtain 

approximately 200 colonies on a TH agar plate (per liter: 30 g TH broth, 15 g 
agar). The colonies are transferred from the original plate onto a piece of 
sterile velvet cloth using a replica plating block and a metal clamp which has 
been pre-marked on one side for orientation. The cells are transferred from the 

25 velvet cloth, first onto a TH agar plate containing the required antibiotic 

marker and then to a plain TH agar plate. All three plates are marked on the 
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corresponding sides to align them with the marking on the metal clamp. The 
plates are then incubated at 37°C for 16 hours. 

The marking on the TH agar plate is aligned with the marking on the 
TH agar selection plate and colonies growing only on the TH agar plate and 
5 not on the selection plate are identified. These cells may be plated onto fresh 
TH agar plates as well as TH agar selection plates for retest. Colonies 
that grow only on TH agar and not on the selection plates are chosen and 
checked for the specific deletion by PCR and Southern blot hybridization. 

In the case of an MDR gene deletion mutant, the drug sensitivity 
10 phenotype is verified by the determination of drug MIC and/or IC^ in 
comparison to that of the wild-type parent strain. 

The vectors and methodology for generating multiple MDR 
knockouts were used to generate a mutant with a deletion in the NorA gene. 
This ANorA mutant can serve as a recipient in transformation experiments 
15 using all the single-gene knockout constructs listed in Table 2. Hence, using 
the methods developed herein we can now rapidly generate two-gene MDR 
knockout mutants. We have generated such mutants (ANorAAAbcl, 
ANorAAAbc2, ANorAAAbcS, ANorAAAbc6, ANorAAAbc7 9 ANorAAAbc8> 
ANorAAAbclO, ANorAAAbcB, ANorAAAbcl6 9 ANorAAAbcl7, 
20 ANorAAAbc22 > ANorAAAbc23 9 ANorAAMf2 9 ANorAAMf3 9 ANorAAMfa 
ANorAA/7, and ANorAAMatel). Significantly, the ANorAAAbcl6 double 
mutant has the predicted phenotype since it exhibits increased dual sensitivity 
to norfloxacin and erythromycin (attributed to the combination of ANorA and 
AAbcl6 9 respectively). The ANorAAAbc7 and ANorAAAbc23 double mutants 
25 also exhibit increased drug sensitivity. 

The foregoing method can be repeated an unlimited number of 
times, thus allowing for the generation of multigene-knockout bacteria (e.g., E. 
faecalis). These bacteria are, in turn, useful for screening of test substances for 
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the identification of novel antibacterial agents, as described in Example 5. The 
method is readily adaptable to any bacterium in which homologous 
recombination can occur. 

Example 5: Use of Multi-MDR Gene-knockout Bacteria for Identification of 
5 Novel Antibacterial Agents 

Bacterial strains having mutations in two or more MDR efflux 
pumps are likely to be hypersensitive to antibacterial agents. Accordingly, 
these strains are useful in assays to identify novel antibacterial compounds. 
Methods of making multi-MDR gene disrupted bacteria and methods for 

10 measuring bacterial cell growth or metabolism are described herein. In the 

screening assays of the present invention, test substances are contacted with the 
bacterial strains for a period of time sufficient to detect any inhibition of 
growth or metabolism. If the test substance that inhibits growth or metabolism 
of a bacterial cell is a collection of compounds, the test substance can be 

15 fractionated using any standard technique to further characterize the active 
compound. If the test substance is a substantially pure compound, this 
compound can be chemically derivatized in order to attempt to find a related 
molecule that has greater activity or that is pumped less efficiently. The assay 
can be adapted for high throughput screening. Methods suitable for high 

20 throughput are described herein. 

Example 6: MDR Efflux Pumps as Drug Targets 

MDR efflux pumps in bacterial cells decrease the sensitivity of the 
cells to antibacterial agents by pumping the agents out of the cells. Thus, a 
compound that blocks efflux of the agent or decreases expression of the pump 
25 will make bacterial cells more sensitive to antibacterial agents. The MDR 
efflux pumps described herein are accordingly novel targets for these 
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compounds. 

Test substances can be screened for their ability to decrease drag 
efflux. In one example, an E. faecalis MDR efflux pump is contacted with a 
test substance, and the ability of a compound in the test substance to bind to 
5 the polypeptide is then determined. Any of a number of binding assays known 
in the art can be performed. In another example, the ability of a test substance 
to decrease expression of an MDR efflux pump is determined. The method 
includes the steps of: (a) providing a cell expressing the MDR efflux pump; (b) 
contacting the cell with the test substance; and (c) measuring expression of the 

10 MDR efflux pump in the cell. Decreased polypeptide expression, relative to a 
cell not contacted with the test substance, indicates that the test substance 
decreases expression of an MDR efflux pump. Expression of the MDR efflux 
pump can be determined by measuring protein levels of the MDR efflux pump 
or by measuring levels of RNA encoding the MDR efflux pump. Finally, 

15 efflux of an antibacterial agent by an MDR efflux pump of the present 

invention can be determined in whole cells in the presence and absence of a 
test substance. In this assay, decreased efflux in the presence of the test 
substance identifies the test substance as containing a compound that blocks 
efflux through the MDR efflux pump. 

20 In a further example, an E. faecalis gene encoding an MDR efflux 

pump is introduced into a surrogate bacterial host (e.g., B. subtilis, L lactis) 
that lacks the MDR gene. The ability of a compound in the test substance to 
block efflux of an antibacterial agent by the MDR efflux pump from the 
surrogate bacterial host can be ascertained by measuring sensitivity to the 

25 antibacterial agent as measured by cell growth. Hence, the MDR efflux pump 
can be expressed in a heterologous bacterial expression host such that the 
introduced MDR efflux pump is the only E. /aecalis-derived gene expressed in 
the surrogate bacterial host. 
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Compounds identified as blocking MDR pump efflux can be used to 
increase the sensitivity of an E. faecalis cell to an antibacterial agent For 
example, the cell can be contacted with a compound that blocks efflux of the 
antibacterial agent from the cell by binding to an MDR efflux pump. 
5 Alternatively, the cell can be contacted with a compound that blocks efflux of 
the antibacterial agent from the cell by decreasing the expression of an MDR 
efflux pump. 

Example 7: Expression of bacterial MDR efflux pumps in non-bacterial cells 
The MDR efflux pumps of the present invention can be expressed in 

10 non-bacterial cells such as yeast cells or mammalian cells (e.g., human cells) in 
order to screen for compounds that block their efflux of antibacterial agents. 
For example, the LmrA MDR gene from L lactus has been functionally 
expressed in GM0637 human lung fibroblast cells (van Veen et al., Nature 
391:391-295, 1998). The lung fibroblast cells expressing the LmrA protein 

15 showed a 10-60 fold increased resistance to a variety of natural product drugs 
and synthetic chemotherapeutic drugs, which are typical substrates of human 
P-glycoprotein. The functional complementation of the human MDR P- 
glycoprotein with an expressed bacterial MDR gene allows the use of 
numerous methods for the discovery and characterization of agents that inhibit 

20 bacterial MDR efflux in mammalian-cell based assays (reviewed in Sharom, J. 
Memb. Biol.l60:161-175, 1997 and Stein, Physiol. Rev. 77:545-590, 1997, 
each of which is hereby incorporated by reference). These approaches involve 
various assay formats for the measurement of the modulation by MDR 
"reversal agents" or blockers of compound efflux mediated by MDR pumps. 

25 Examples include the measurement, in the presence and absence of a test 
substance, of: (i) cytotoxicity mediated by an MDR-pump substrate (e.g. 
doxorubicin); (ii) cellular uptake of a radiolabeled MDR-pump substrate (e.g. 
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[ 3 H]daunorubicin); and (iii) accumulation of a fluorescent MDR-pump 
substrate (e.g. Rhodamine 123) (Bosch et al., Leukemia 11:1131-1137, 1997). 

In another example, the bacterial MDR gene, EmrE from E. coli, has 
been expressed in the yeast Saccharomyces cerevisiae and shown to confer 

5 resistance to a wide variety of drugs, including acriflavin, ethidium, and methyl 
viologen (Yelin et al., J. Bacteriol. 181:949-956, 1999). Although the EmrE 
protein pumped drugs into the vacuolar compartment rather than into the 
exterior of the cell, this report further indicates the feasibility of expressing 
bacterial MDR genes in a non-bacterial background. As with the mammalian 

10 cell-based MDR expression system described previously, lower-eukaryote 

heterologous expression hosts can also be used as the basis for the discovery of 
MDR inhibition agents. 

Example 8: Cell-free assays for the identification of compounds that block 
MDR efflux pumps 

15 In addition to performing screening assays in non-bacterial cells, 

assays can also be performed using cell free assays. The bacterial MDR LmrP 
protein from L lactus has been purified, reconstituted in dodecylmaltoside- 
destabilized preformed liposomes, and shown to mediate the transport of 
multiple drugs in response to an artificially-imposed pH gradient (Putman et 

20 al., Biochemistry 38:1002-1008, (1999). These studies demonstrate the 

applicability of using cell-free systems for the discovery of blockers of drug 
efflux mediated by MDR proteins. The transport assays described above (e.g., 
using radiolabeled [ 3 H]daunorubicin or Rhodamine 123) can be readily 
adapted to the cell free assays. 
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Test Substances 

In general, test substances and compounds are identified from large 
libraries of both natural product and synthetic (or semi-synthetic) extracts or 
chemical libraries according to methods known in the art Those skilled in the 
5 field of drug discovery and development will understand that the precise 
source of the test substance is not critical to the screening procedure(s) of the 
invention. Accordingly, virtually any number of chemical extracts or 
compounds can be screened using the methods described herein. Examples of 
such extracts or compounds include, but are not limited to, plant-, fungal-, 

10 prokaryotic- or animal-based extracts, fermentation broths, and synthetic 
compounds, as well as modification of existing compounds. Numerous 
methods are also available for generating random or directed synthesis (e.g., 
semi-synthesis or total synthesis) of any number of chemical compounds, 
including, but not limited to, saccharide-, lipid-, peptide-, and nucleic acid- 

15 based compounds. In addition, natural and synthetically produced libraries are 
produced, if desired, according to methods known in the art, e.g., by standard 
extraction, fractionation, or synthetic methods. Furthermore, if desired, any 
library or compound is readily modified using standard chemical, physical, or 
biochemical methods. 

20 In addition, those skilled in the art of drug discovery and 

development readily understand that methods for dereplication (e.g., 
taxonomic dereplication, biological dereplication, and chemical dereplication, 
or any combination thereof) or the elimination of replicates or repeats of 
materials already known for their anti-pathogenic activity should be employed 

25 whenever possible. 

When a test substance, as a mixture, is found to inhibit growth or 
metabolism of a bacterial cell, further fractionation of the positive lead extract 
may be necessary to isolate chemical constituents responsible for the observed 
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effect. Thus, the goal of the extraction, fractionation, and purification process 
is the careful characterization and identification of a chemical entity having the 
desired activity within the test substance. Methods of fractionation and 
purification of test substances are known in the art. If desired, test substances 
5 shown to be useful agents for the inhibition of bacterial growth or metabolism 
are chemically modified according to methods known in the art 

: 
I 

Pharmaceutical Therapeutics and Plant Protectants 

The invention provides a simple means for identifying compounds 
(including peptides, small molecule inhibitors, and mimetics) capable of 

10 inhibiting bacterial growth or metabolism. Accordingly, antibacterial agents 
discovered to have medicinal or agricultural value using the methods described 
herein are useful as either drugs, plant protectants, or as information for 
structural modification of existing anti-bacterial compounds, e.g., by rational 
drug design. Such methods are useful for screening compounds having an 

15 effect on a variety of bacteria. 

For therapeutic uses, the compositions or agents identified using the 
methods disclosed herein may be administered systemically, for example, 
formulated in a pharmaceutically-acceptable buffer such as physiological 
saline. Suitable routes of administration include, for example, orally, by 

20 inhalation, or by subcutaneous, intravenous, interperitoneal, intramuscular, or 
intradermal injections which provide continuous, sustained levels of the drug 
in the patient. Treatment of human patients or other animals will be carried out 
using a therapeutically effective amount of an anti-bacterial agent in a 
physiologically-acceptable carrier. The amount of the anti-bacterial agent to be 

25 administered varies depending upon the manner of administration, the age and 
body weight of the patient, and with the type of disease and extensiveness of 
the disease. Generally, amounts will be in the range of those used for other 
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agents used in the treatment of other bacterial diseases, although in certain 
instances lower amounts will be needed because of the increased specificity of 
the compound. A compound is administered at a dosage that inhibits bacterial 
growth or metabolism. For example, for systemic administration a compound 
5 is administered typically in the range of 0. 1 fig. to 10 g/kg body weight. 

In addition, the antibacterial agent may be added to materials used to 
make catheters, including but not limited to intravenous, urinary, 
intraperitoneal, ventricular, spinal and surgical drainage catheters, in order to 
prevent colonization and systemic seeding by potential pathogens. Similarly, 
* 10 the antibacterial agent may be added to the materials that constitute various 
surgical prostheses and to dentures to prevent colonization by pathogens and 
thereby prevent more serious invasive infection or systemic seeding by 
pathogens. 

For agricultural uses, the compositions or agents identified using the 
15 methods disclosed herein may be used as chemicals applied as sprays or dusts 
on the foliage of plants, or in irrigation systems. Typically, such agents are to 
be administered on the surface of the plant in advance of the pathogen in order 
to prevent infection. Seeds, bulbs, roots, tubers, and conns are also treated to 
prevent pathogenic attack after planting by controlling pathogens carried on 
20 them or existing in the soil at the planting site. Soil to be planted with 
vegetables, ornamentals, shrubs, or trees can also be treated with chemical 
fiunigants for control of a variety of bacterial pathogens. Treatment can be 
done several days or weeks before planting. The chemicals can be applied by 
either a mechanized route, e.g., a tractor or with hand applications. In addition, 
25 chemicals identified using the methods of the assay can be used as 
disinfectants. 
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Other Embodiments 

The present invention has been described in terms of particular 
embodiments found or proposed by the present inventors to include exemplary 
modes for the practice of the invention. It will be appreciated by those of skill 
5 in the art that, in light of the present disclosure, numerous modifications and 
changes can be made in the particular embodiments exemplified without 
departing from the intended scope of the invention. All such modifications are 
intended to be included within the scope of the invention. 

What we claim is: 
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1. A method of determining whether a candidate nucleotide 
sequence encodes an MDR efflux pump, said method comprising the steps of: 

(a) searching a database of nucleotide sequences for sequences 
having high identity to a sequence encoding a known MDR efflux pump to 

5 generate a first set of one or more candidate sequences comprising sequences 
that have high identity to the sequence encoding the known MDR efflux pump; 

(b) from the first set of candidate sequences, selecting one or more 
sequences that include a sequence encoding one or more potential 
transmembrane domains to generate a second set of candidate sequences; 

10 (c) in a bacterial cell, mutating a gene corresponding to one of the 

second set of candidate sequences; and 

(d) detennining whether the bacterial cell exhibits increased 
sensitivity to an antibacterial agent, wherein the candidate sequence encodes an 
MDR efflux pump if the bacterial cell exhibits increased sensitivity to the 

15 antibacterial agent 
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2. A method of determining whether a polypeptide sequence 
functions as an MDR efflux pump, said method comprising the steps of: 

(a) searching a database of polypeptide sequences for sequences 
having high identity to a polypeptide sequence known to function as an MDR 

5 efflux pump to generate a first set of one or more candidate polypeptides; 

(b) from the first set of candidates, selecting one or more that have 
potential transmembrane domains to generate a second set of candidate 
polypeptides; 

(c) in a bacterial cell, mutating a gene encoding one of the second set 
10 of candidate polypeptides; and 

(d) determining whether the bacterial cell exhibits decreased 
sensitivity to antibacterial agents, wherein the candidate polypeptide functions 
as an MDR efflux pump if the cell exhibits decreased sensitivity to an 
antibacterial agent. 
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3. A method for deleting a desired region of DNA in a bacterial cell, 
said method comprising the steps of: 

(a) transforming bacterial cells with a vector comprising (i) a first 
region of at least 30 nucleotides substantially identical to a first region of 
5 chromosomal DNA in the bacterial cells; (ii) a second region of at least 30 
nucleotides substantially identical to a second region of chromosomal DNA in 
the bacterial cells; and (iii) a third region encoding a polypeptide that provides 
resistance to a selection agent, wherein the first and second regions of 
chromosomal DNA are on opposite sides of the region of DNA to be deleted; 

10 (b) selecting for bacterial cells in which the vector of step (a) has 

integrated by a first crossover event into the chromosomal DNA by adding the 
selection agent to the culture medium; and 

(c) culturing the cells selected for in step (b) in the absence of a 
selection agent to allow for a second crossover event to occur in at least a 

15 subset of the cells, wherein the second crossover event results in the loss of (i) 
the desired region of DNA and (ii) the region encoding the polypeptide that 
provides resistance to the selection agent in that subset of cells. 

4. The method of claim 3, wherein steps (a)-(c) are repeated one or 
more times to delete additional regions of DNA in the same bacterial cell from 

20 which the first region was deleted. 
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5. A method for deleting two regions of DNA in a bacterial cell, 
said method comprising the steps of: 

(a) transforming bacterial cells with a vector comprising (i) a first 
region of at least 30 nucleotides substantially identical to a first region of 

5 chromosomal DNA in the bacterial cells; (ii) a second region of at least 30 
nucleotides substantially identical to a second region of chromosomal DNA in 
the bacterial cells; and (iii) a third region encoding a polypeptide that provides 
resistance to a selection agent, wherein the first and second regions of 
chromosomal DNA are on opposite sides of a first region of DNA to be 
10 deleted; 

(b) selecting for bacterial cells in which the vector of step (a) has 
integrated by a first crossover event into the chromosomal DNA by adding the 
selection agent to the culture medium; 

(c) culturing the cells selected for in step (b) in the absence of a 
15 selection agent to allow for a second crossover event to occur in at least a 

subset of the cells, wherein the second crossover event results in the loss of (i) 
the first region of DNA and (ii) the region encoding the polypeptide that 
provides resistance to the selection agent in that subset of cells; 

(d) transforming bacterial cells resulting from step (c) with a vector 
20 comprising (i) a first region of at least 30 nucleotides substantially identical to 

a first region of chromosomal DNA in the bacterial cells; (ii) a second region 
of at least 30 nucleotides substantially identical to a second region of 
chromosomal DNA in the bacterial cells; and (iii) a third region encoding a 
polypeptide that provides resistance to a selection agent, wherein the first and 
25 second regions of chromosomal DNA are on opposite sides of a second region 
ofDNAtobe deleted; 

(e) selecting for bacterial cells in which the vector of step (d) has 
integrated by a first crossover event into the chromosomal DNA by adding the 
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selection agent to the culture medium; 

(f) culturing the cells selected for in step (e) in the absence of a 
selection agent to allow for a second crossover event to occur in at least a 
subset of the cells, wherein the second crossover event results in the loss of (i) 
5 the second region of DNA and (ii) the region encoding the polypeptide that 
provides resistance to the selection agent in that subset of cells. 

6. The method of claim 5, wherein said bacterial cell is selected 
from the group consisting of Staphylococcus aureus, Streptococcus pyogenes \ 
Bacillus anthracis, Clostridium tetani, Clostridium bolulinum, Vibrio 
10 cholerae, Helicobacter pylori, Salmonella typhimurium, Shigella dysenteriae, 
Bordetella pertussis, Yersinia pestis, Pseudomonas aeruginosa, Neisseria 
gonorrhoeae, Mycobacterium tuberculosis, Corynebacterium diptheriae, 
Borrelia burdorferi, Treponema pallidum, Enterococcus faecalis, 
Enterococcus faeciwn, and Streptococcus pneumoniae. 
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7. A method for determining whether a test substance inhibits the 
growth or metabolism of cells of a strain of Enterococcus faecalis having a 
disruptive mutation in a gene encoding a protein selected from the group 
consisting of Abel (SEQ ID NO: 2), Abc2 (SEQ ID NO: 4), Abc3 (SEQ ID 
5 NO: 6), Abc4 (SEQ ID NO: 8), Abc5 (SEQ ID NO: 10), Abc6 (SEQ ID NO: 
12), Abc7 (SEQ ID NO: 14), Abc8 (SEQ ID NO: 16), Abc9 (SEQ ID NO: 18), 
AbclO (SEQ ID NO: 20), Abel 1 (SEQ ID NO: 22), Abcl2 (SEQ ID NO: 24), 
Abcl3 (SEQ ID NO: 26), Abcl4 (SEQ ID NO: 28), Abcl5 (SEQ ID NO: 30), 
Abcl6 (SEQ ID NO: 32), Abcl7 (SEQ ID NO: 34), Abcl8 (SEQ ID NO: 36), 

10 Abcl9 (SEQ ID NO: 38), Abc20 (SEQ ID NO: 40), Abc21 (SEQ ID NO: 42), 
Abc22 (SEQ ID NO: 44), Abc23 (SEQ ID NO: 46), Bmr (SEQ ID NO: 48), 
NorA (SEQ ID NO: 50), Mfl (SEQ ID NO: 52), M£2 (SEQ ID NO: 54), Mf3 
(SEQ ID NO: 56), Mf4 (SEQ ID NO: 58), Mf5 (SEQ ID NO: 60), Mf6 (SEQ 
ID NO: 62), Mf7 (SEQ ID NO: 64), Matel (SEQ ID NO: 66), and Smrl (SEQ 

15 ID NO: 68), said method comprising the steps of: 

(a) contacting the cells with a test substance; and 

(b) determining whether the growth or metabolism of the cells is 
inhibited, wherein an inhibition of growth or metabolism, relative to the 
growth or metabolism of cells of the strain of bacteria having the mutation but 

20 not contacted with the test substance, identifies the test substance as one 
having the potential use as an antibacterial agent. 
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8. The method of claim 7, wherein the bacterial strain has disruptive 
mutations in genes encoding at least two of the proteins selected from the 
group consisting of Abel (SEQ ID NO: 2), Abc2 (SEQ ID NO: 4), Abc3 (SEQ 
ID NO: 6), Abc4 (SEQ ID NO: 8), Abc5 (SEQ ID NO: 10), Abc6 (SEQ ID 
5 NO: 12), Abc7 (SEQ ID NO: 14), Abc8 (SEQ ID NO: 16), Abc9 (SEQ ID 
NO: 18), Abe 10 (SEQ ID NO: 20), Abel 1 (SEQ ED NO: 22), Abcl2 (SEQ ID 
NO: 24), Abcl3 (SEQ ED NO: 26), Abcl4 (SEQ ED NO: 28), Abcl5 (SEQ ED 
NO: 30), Abcl6 (SEQ ED NO: 32), Abcl7 (SEQ ED NO: 34), Abcl8 (SEQ ED 
NO: 36), Abcl9 (SEQ ED NO: 38), Abc20 (SEQ ED NO: 40), Abc21 (SEQ ED 
10 NO: 42), Abc22 (SEQ ED NO: 44), Abc23 (SEQ ED NO: 46), Bmr (SEQ ED 
NO: 48), NorA (SEQ ED NO: 50), Mfl (SEQ ED NO: 52), Mf2 (SEQ ED NO: 
54), Mf3 (SEQ ED NO: 56), Mf4 (SEQ ED NO: 58), Mf5 (SEQ ED NO: 60), 
Mf6 (SEQ ED NO: 62), Mf7 (SEQ ED NO: 64), Matel (SEQ ED NO: 66), and 
Smrl (SEQ ID NO: 68). 

15 9. A substantially pure nucleic acid molecule consisting essentially 

of Enterococcusfaeccdis Abel (SEQ ED NO: 1), Abc2 (SEQ ED NO: 3), Abc3 
(SEQ ED NO: 5), Abc4 (SEQ ED NO: 7), Abc5 (SEQ ED NO: 9), Abc6 (SEQ ED 
NO: 11), Abc7 (SEQ ED NO: 13), Abc8 (SEQ ED NO: 15), Abc9 (SEQ ED NO: 
17), AbclO (SEQ ED NO: 19), Abcll (SEQ ED NO: 21), Abcl2 (SEQ ED NO: 

20 23), Abcl3 (SEQ ED NO: 25), AbcU (SEQ ED NO: 27), AbclS (SEQ ED NO: 
29), Abcl6 (SEQ ED NO: 31), AbcU (SEQ ED NO: 33), Abcl8 (SEQ ED NO: 
35), Abcl9 (SEQ ED NO: 37), Abc20 (SEQ ED NO: 39), Abcll (SEQ ED NO: 
41), Abc22 (SEQ ED NO: 43), Abc23 (SEQ ED NO: 45), Bmr (SEQ ED NO: 
47), NorA (SEQ ED NO: 49), Mfl (SEQ ED NO: 51), Mf2 (SEQ ED NO: 53), 

25 Mf3 (SEQ ED NO: 55), Mf4 (SEQ ID NO: 57), Mf5 (SEQ ED NO: 59), Mf6 
(SEQ ED NO: 61), M/7 (SEQ ED NO: 63), Matel (SEQ ED NO: 65), or Smrl 
(SEQ ED NO: 67). 
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10. A substantially pure polypeptide comprising a sequence selected 
from the group consisting of Enterococcus faecalis Abel (SEQ ID NO: 2), 
Abc2 (SEQ ID NO: 4), Abc3 (SEQ ID NO: 6), Abc4 (SEQ ID NO: 8), Abc5 
(SEQ ID NO: 10), Abc6 (SEQ ID NO: 12), Abc7 (SEQ ID NO: 14), Abc8 

5 (SEQ ID NO: 16), Abc9 (SEQ ID NO: 18), Abe 10 (SEQ ID NO: 20), Abel 1 
(SEQ ID NO: 22), Abcl2 (SEQ ID NO: 24), Abcl3 (SEQ ID NO: 26), Abcl4 
(SEQ ID NO: 28), Abcl5 (SEQ ID NO: 30), Abcl6 (SEQ ED NO: 32), Abcl7 
(SEQ ID NO: 34), Abel 8 (SEQ ED NO: 36), Abcl9 (SEQ ED NO: 38), Abc20 
(SEQ ID NO: 40), Abc21 (SEQ ID NO: 42), Abc22 (SEQ ED NO: 44), Abc23 
10 (SEQ ED NO: 46), Bmr (SEQ ED NO: 48), NorA (SEQ ID NO: 50), Mfl (SEQ 
ID NO: 52), Mf2 (SEQ ED NO: 54), Mf3 (SEQ ID NO: 56), Mf4 (SEQ ID 
NO: 58), Mf5 (SEQ ID NO: 60), Mf6 (SEQ ID NO: 62), Mf7 (SEQ ID NO: 
64), Matel (SEQ ED NO: 66), and Smrl (SEQ ED NO: 68). 

11. An Enterococcus faecalis bacterial cell having a disruptive 
15 mutation in a gene encoding a protein selected from the group consisting of 

Abel (SEQ ED NO: 2), Abc2 (SEQ ED NO: 4), Abc3 (SEQ ID NO: 6), Abc4 
(SEQ ID NO: 8), Abc5 (SEQ ED NO: 10), Abc6 (SEQ ED NO: 12), Abc7 
(SEQ ID NO: 14), Abc8 (SEQ ID NO: 16), Abc9 (SEQ ED NO: 18), AbclO 
(SEQ ED NO: 20), Abel 1 (SEQ ED NO: 22), Abcl2 (SEQ DD NO: 24), Abcl3 

20 (SEQ ED NO: 26), Abcl4 (SEQ DD NO: 28), Abcl5 (SEQ ED NO: 30), Abcl6 
(SEQ DD NO: 32), Abcl7 (SEQ DD NO: 34), Abcl8 (SEQ ED NO: 36), Abcl9 
(SEQ ED NO: 38), Abc20 (SEQ DD NO: 40), Abc21 (SEQ DD NO: 42), Abc22 
(SEQ DD NO: 44), Abc23 (SEQ DD NO: 46), Bmr (SEQ DD NO: 48), NorA 
(SEQ DD NO: 50), Mfl (SEQ ED NO: 52), Mf2 (SEQ DD NO: 54), Mf3 (SEQ 

25 ED NO: 56), Mf4 (SEQ DD NO: 58), Mf5 (SEQ ED NO: 60), Mf6 (SEQ DD 
NO: 62), Mf7 (SEQ ED NO: 64), Matel (SEQ DD NO: 66), and Smrl (SEQ DD 
NO: 68). 
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12. The bacterial cell of claim 11, wherein the bacterial strain has 
disruptive mutations in genes encoding at least two of the proteins selected 
from the group consisting of Abel (SEQ ID NO: 2), Abc2 (SEQ ID NO: 4), 
Abc3 (SEQ ID NO: 6), Abc4 (SEQ ID NO: 8), Abc5 (SEQ ID NO: 10), Abc6 
5 (SEQ ID NO: 12), Abc7 (SEQ ID NO: 14), Abc8 (SEQ ID NO: 16), Abc9 
(SEQ ID NO: 18), AbclO (SEQ ID NO: 20), Abel 1 (SEQ ID NO: 22), Abcl2 
(SEQ ID NO: 24), Abcl3 (SEQ ID NO: 26), Abcl4 (SEQ ID NO: 28), Abcl5 
(SEQ ID NO: 30), Abcl6 (SEQ ID NO: 32), Abcl7 (SEQ ID NO: 34), Abcl8 
(SEQ ID NO: 36), Abcl9 (SEQ ID NO: 38), Abc20 (SEQ ID NO: 40), Abc21 
10 (SEQ ID NO: 42), Abc22 (SEQ ID NO: 44), Abc23 (SEQ ID NO: 46), Bmr 
(SEQ ID NO: 48), NorA (SEQ ID NO: 50), Mf 1 (SEQ ID NO: 52), Mf2 (SEQ 
ID NO: 54), Mf3 (SEQ ID NO: 56), Mf4 (SEQ ID NO: 58), Mf5 (SEQ ED 
NO: 60), Mf6 (SEQ ED NO: 62), Mf7 (SEQ ED NO: 64), Matel (SEQ ED NO: 
66), and Smrl (SEQ ED NO: 68). 
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13. A method for determining whether a test substance includes a 
compound that potentially blocks efflux of antibacterial agents from a cell, said 
method comprising the steps of: 

(a) providing a polypeptide selected from the group consisting of 
5 Enterococcusfaecalis Abel (SEQ ED NO: 2), Abc2 (SEQ ID NO: 4), Abc3 

(SEQ ID NO: 6), Abc4 (SEQ ID NO: 8), Abc5 (SEQ ED NO: 10), Abc6 (SEQ 
ED NO: 12), Abc7 (SEQ ED NO: 14), Abc8 (SEQ ED NO: 16), Abc9 (SEQ ED 
NO: 18), AbclO (SEQ ID NO: 20), Abcll (SEQ ID NO: 22), Abcl2 (SEQ ED 
NO: 24), Abcl3 (SEQ ED NO: 26), Abcl4 (SEQ ID NO: 28), Abcl5 (SEQ ED 

10 NO: 30), Abcl6 (SEQ ED NO: 32), Abcl7 (SEQ ID NO: 34), Abcl8 (SEQ ID 
NO: 36), Abcl9 (SEQ ED NO: 38), Abc20 (SEQ ED NO: 40), Abc21 (SEQ ED 
NO: 42), Abc22 (SEQ ED NO: 44), Abc23 (SEQ ED NO: 46), Bmr (SEQ ED 
NO: 48), NorA (SEQ ID NO: 50), Mfl (SEQ ED NO: 52), Mf2 (SEQ ED NO: 
54), Mf3 (SEQ ED NO: 56), Mf4 (SEQ ID NO: 58), Mf5 (SEQ ED NO: 60), 

15 Mf6 (SEQ ID NO: 62), Mf7 (SEQ ED NO: 64), Matel (SEQ ID NO: 66), and 
Smrl (SEQ ED NO: 68); 

(b) contacting the polypeptide with the test substance; and 

(c) determining whether a compound in the test substance binds to 
the polypeptide, wherein binding of the compound to the polypeptide identifies 

20 it as a compound that potentially blocks efflux of antibacterial agents from a 
cell. 

14. The method of claim 13, wherein said method is performed in 

vitro. 

15. The method of claim 13, wherein said method is performed in 

25 vivo. 
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16. The method of claim 13, wherein said polypeptide is expressed 
in a strain other than a strain of Enterococcus faecalis. 

17. The method of claim 13, wherein the strain is a strain of 
Bacillus subtilis or Lactococcus lactis. 

5 

18. A method for determining whether a test substance includes a 
compound that reduces efflux of antibacterial agents from a cell, said method 
comprising the steps of: 

(a) providing a cell expressing a gene encoding an MDR efflux 

10 pump selected from Enterococcus faecalis Abel (SEQ ID NO: 2), Abc2 (SEQ 
ID NO: 4), Abc3 (SEQ ID NO: 6), Abc4 (SEQ ID NO: 8), Abc5 (SEQ ID NO: 
10), Abc6 (SEQ ID NO: 12), Abc7 (SEQ ID NO: 14), Abc8 (SEQ ID NO: 16), 
Abc9 (SEQ ID NO: 18), AbclO (SEQ ID NO: 20), Abcll (SEQ ID NO: 22), 
Abcl2 (SEQ ID NO: 24), Abcl3 (SEQ ID NO: 26), Abcl4 (SEQ ID NO: 28), 

15 Abcl5 (SEQ ID NO: 30), Abcl6 (SEQ ID NO: 32), Abcl7 (SEQ ID NO: 34), 
Abcl8 (SEQ ID NO: 36), Abcl9 (SEQ ID NO: 38), Abc20 (SEQ ID NO: 40), 
Abc21 (SEQ ID NO: 42), Abc22 (SEQ ID NO: 44), Abc23 (SEQ ID NO: 46), 
Bmr (SEQ ID NO: 48), NorA (SEQ ID NO: 50), Mfl (SEQ ID NO: 52), Mf2 
(SEQ ID NO: 54), Mf3 (SEQ ED NO: 56), Mf4 (SEQ ID NO: 58), Mf5 (SEQ 

20 ID NO: 60), Mf6 (SEQ ID NO: 62), Mf7 (SEQ ID NO: 64), Matel (SEQ ID 
NO: 66), and Smrl (SEQ ID NO: 68); 

(b) contacting the cell with the test substance; and 

(c) measuring expression of the gene encoding the MDR efflux 
pump in the cell, wherein decreased expression, relative to a cell not contacted 

25 with the test substance, identifies the test substance as containing a compound 
that reduces efflux of antibacterial agents from a cell. 
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19. The method of claim 18, wherein expression of the gene 
encoding the MDR efflux pump is determined by measuring protein levels of 
the MDR efflux pump. 

20. The method of claim 18, wherein expression of gene encoding 

5 the MDR efflux pump is determined by measuring levels of RNA encoding the 
MDR efflux pump. 

21. The method of claim 18, wherein said MDR efflux pump is 
expressed in a strain other than a strain of E. faecalis. 

22. The method of claim 21, wherein the strain is a strain of B. 
10 subtilis or L lactis. 
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23. A method for increasing the sensitivity of an E. faecalis cell to 
an antibacterial agent, said method comprising the step of contacting the cell 
with a compound that blocks efflux of the antibacterial agent from the cell by 
binding to an MDR efflux pump selected from the group consisting of E. 
5 faecalis Abel (SEQ ED NO: 2), Abc2 (SEQ ED NO: 4), Abc3 (SEQ ED NO: 6), 
Abc4 (SEQ ED NO: 8), Abc5 (SEQ ED NO: 10), Abc6 (SEQ ED NO: 12), Abc7 
(SEQ ED NO: 14), Abc8 (SEQ ED NO: 16), Abc9 (SEQ ED NO: 18), AbclO 
(SEQ ED NO: 20), Abcll (SEQ ED NO: 22), Abcl2 (SEQ ED NO: 24), Abcl3 
(SEQ ID NO: 26), Abe 14 (SEQ ED NO: 28), Abcl5 (SEQ ED NO: 30), Abcl6 

10 (SEQ ED NO: 32), Abcl7 (SEQ ED NO: 34), Abcl8 (SEQ ED NO: 36), Abcl9 
(SEQ ID NO: 38), Abc20 (SEQ ED NO: 40), Abc21 (SEQ ED NO: 42), Abc22 
(SEQ ID NO: 44), Abc23 (SEQ ED NO: 46), Bmr (SEQ ED NO: 48), NorA 
(SEQ ED NO: 50), Mfl (SEQ ED NO: 52), Mf2 (SEQ ED NO: 54), Mf3 (SEQ 
ID NO: 56), Mf4 (SEQ ED NO: 58), Mf5 (SEQ ED NO: 60), Mf6 (SEQ ED 

15 NO: 62), Mf7 (SEQ ED NO: 64), Matel (SEQ ID NO: 66), and Smrl (SEQ ED 
NO: 68). 
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24. A method for increasing the sensitivity of an E. faecalis cell to 
an antibacterial agent, said method comprising the step of contacting the cell 
with a compound that blocks efflux of the antibacterial agent from the cell by 
decreasing the expression of a gene encoding an MDR efflux pump selected 
5 from the group consisting of E. faecalis Abel (SEQ ID NO: 2), Abc2 (SEQ ID 
NO: 4), Abc3 (SEQ ID NO: 6), Abc4 (SEQ ID NO: 8), Abc5 (SEQ ID NO: 
10), Abc6 (SEQ ID NO: 12), Abc7 (SEQ ID NO: 14), Abc8 (SEQ ID NO: 16), 
Abc9 (SEQ ID NO: 18), AbclO (SEQ ID NO: 20), Abcll (SEQ ID NO: 22), 
Abcl2 (SEQ ID NO: 24), Abcl3 (SEQ ID NO: 26), Abcl4 (SEQ ID NO: 28), 

10 Abcl5 (SEQ ID NO: 30), Abcl6 (SEQ ID NO: 32), Abcl7 (SEQ ID NO: 34), 
Abcl8 (SEQ ID NO: 36), Abcl9 (SEQ ID NO: 38), Abc20 (SEQ ID NO: 40), 
Abc21 (SEQ ID NO: 42), Abc22 (SEQ ID NO: 44), Abc23 (SEQ ID NO: 46), 
Bmr (SEQ ID NO: 48), NorA (SEQ ID NO: 50), Mfl (SEQ ID NO: 52), Mf2 
(SEQ ID NO: 54), Mf3 (SEQ ID NO: 56), Mf4 (SEQ ID NO: 58), Mf5 (SEQ 

15 ID NO: 60), Mf6 (SEQ ED NO: 62), Mf7 (SEQ ID NO: 64), Matel (SEQ ID 
NO: 66), and Smrl (SEQ ID NO: 68). 
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25. A method for determining whether a test substance functions as 
an inhibitor of an MDR efflux pump, said method comprising the steps of: 

(a) expressing a nucleic acid molecule encoding an MDR efflux 
pump derived from a first bacterial strain in a second bacterial strain to provide 

5 increased resistance to an antibacterial agent relative to the second bacterial 
strain not expressing the MDR efflux pump; 

(b) contacting the second bacterial strain with an amount of the 
antibacterial agent that inhibits growth of a control strain but does not 
substantially inhibit growth of the second bacterial strain; 

10 (c) contacting the second bacterial strain with a test substance; and 

(d) measuring growth of second bacterial strain, wherein decreased 
growth of the bacterial strain identifies the test substance as one that includes 
an inhibitor of an MDR efflux pump. 

15 26. The method of claim 25, wherein the second bacterial strain is a 

strain of Bacillus subtilis or Lactococcus lactis. 
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27. A method for determining whether a test substance includes a 
compound that reduces efflux of antibacterial agents from a cell, said method 
comprising the steps of: 

(a) providing a non-bacterial cell expressing a nucleic acid molecule 
5 encoding an MDR efflux pump selected from Enterococcus faecalis Abel 

(SEQ ID NO: 2), Abc2 (SEQ ID NO: 4), Abc3 (SEQ ID NO: 6), Abc4 (SEQ 
ID NO: 8), Abc5 (SEQ ID NO 10), Abc6 (SEQ ID NO: 12), Abc7 (SEQ ID 
NO: 14), Abc8 (SEQ ID NO: 16), Abc9 (SEQ ID NO: 18), Abe 10 (SEQ ID 
NO: 20), Abcll (SEQ ID NO: 22), Abcl2 (SEQ ID NO: 24), Abcl3 (SEQ ID 

10 NO: 26), Abcl4 (SEQ ID NO: 28), Abcl5 (SEQ ID NO: 30), Abcl6 (SEQ ID 
NO: 32), Abcl7 (SEQ ID NO: 34), Abcl8 (SEQ ID NO: 36), Abcl9 (SEQ ID 
NO: 38), Abc20 (SEQ ID NO: 40), Abc21 (SEQ ID NO: 42), Abc22 (SEQ ID 
NO: 44), Abc23 (SEQ ID NO: 46), Bmr (SEQ ID NO: 48), NorA (SEQ ID 
NO: 50), Mfl (SEQ ID NO: 52), Mf2 (SEQ ID NO: 54), Mf3 (SEQ ID NO: 

15 56), Mf4 (SEQ ID NO: 58), Mf5 (SEQ ID NO: 60), Mf6 (SEQ ID NO: 62), 
Mf7 (SEQ ID NO: 64), Matel (SEQ ID NO: 66), and Smrl (SEQ ID NO: 68); 

(b) contacting the cell with the test substance; 

(c) contacting the cell with a compound that is capable of (i) 
entering the cell and (ii) being transported by the MDR efflux pump; and 

20 (d) measuring efflux of the compound from the cell, wherein 

decreased efflux, relative to a cell not contacted with the test substance, 
identifies the test substance as containing a compound that reduces efflux of 
antibacterial agents from a cell. 

28. The method of claim 27, wherein the cell is a eukaryotic cell. 
25 29. The method of claim 28, wherein the cell is a mammalian cell. 



-59- 



WO 01/79257 PCTAJS01/12230 

30. The method of claim 28, wherein the cell is an insect cell. 

31. The method of claim 28, wherein the cell is a yeast cell. 

32. The method of claim 31, wherein the yeast is Saccharomyces 
cerevisiae. 
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33. A method for determining whether a test substance includes a 
compound that reduces efflux of antibacterial agents from a cell, said method 
comprising the steps of: 

(a) providing a cell free system consisting of: 

5 (i) a lipid membrane into which is inserted an MDR efflux 

pump selected from Enterococcus faecalis Abel (SEQ ID NO: 2), Abc2 (SEQ 
ID NO: 4), Abc3 (SEQ ID NO: 6), Abc4 (SEQ ID NO: 8), Abc5 (SEQ ID 10), 
Abc6 (SEQ ID NO: 12), Abc7 (SEQ ID NO: 14), Abc8 (SEQ ID NO: 16), 
Abc9 (SEQ ID NO: 18), AbclO (SEQ ID NO: 20), Abel 1 (SEQ ID NO: 22), 

10 Abcl2 (SEQ ID NO: 24), Abcl3 (SEQ ID NO: 26), Abcl4 (SEQ ID NO: 28), 
AbclS (SEQ ID NO: 30), Abcl6 (SEQ IDNO: 32), Abcl7 (SEQ ID NO: 34), 
Abcl8 (SEQ ID NO: 36), Abcl9 (SEQ ID NO: 38), Abc20 (SEQ ID NO: 40), 
Abc21 (SEQ ID NO: 42), Abc22 (SEQ ID NO: 44), Abc23 (SEQ ID NO: 46), 
Bmr (SEQ ID NO: 48), NorA (SEQ ID NO: 50), Mfl (SEQ ID NO: 52), Mf2 

15 (SEQ ID NO: 54), Mf3 (SEQ ED NO: 56), Mf4 (SEQ ID NO: 58), Mf5 (SEQ 
ID NO: 60), Mf6 (SEQ ID NO: 62), Mf7 (SEQ ID NO: 64), Matel (SEQ ID 
NO: 66), and Smrl (SEQ ID NO: 68); 

(ii) the test substance; . 

(iii) a compound that is capable of being transported by the 
20 MDR efflux pump; and 

(b) measuring transport of the compound across the lipid membrane, 
wherein decreased transport, relative to transport in a cell-free system not 
comprising the test substance, identifies the test substance as containing a 
compound that reduces efflux of antibacterial agents from a cell. 

25 34. The method of claim 33, wherein said lipid membrane is in the 

form of a liposome. 
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5' ATG TTT GGT TTA TTG AAA TAC GCA AAA AAT TAT CGA AAA CAA ATT ATC CTC GGT 

MFG LLKYAKNYRKQI ILG 
CCA GTC TTC AAA TTT TTA GAA GCT TGT TTC GAG TTA GTC TTG CCC TTA TTC ATG 

pVFKFIiEACFELVLPLFM 
GCT CAC TTA GTC GAT GTC GGT ATT CGT CAA AAC GAC CGA CAG ACT GTC ATA GAA 

AHLVDVGIRONDRQTVIB 
ATG GCC CTC TGG ATG CTT GTT ATG TCG TTA GTT GGT TTA TTT TTT GTC ATG ATT 

MALWMLVMSLVGLFFVMI 
TGT CAA TAC TAT GCT TCA GTC GCT TCG CAA GGT TTT GGG ACG GAG CTA AGA AAT 

CQYYASVASQGFGTBLRN 
CAA TTA ATG AAG AAA ATT AAT CAG CTT TCA CAC AAA GAA TTG AAT AGT TTT GGT 

QLMKKINQLSHKELNSFG 
ACA GAT ACC CTC ATC ACC CGG ATC ACA AAC GAT ATC AAC CAG CTT CAA TTA GCT 

TDTLITRITNDINQLQLA 
TTA GCG ATG TTT ATT CGG TTG GTC ATT CGG ACA CCT TTT TTA AGT ATC GGT TCT 

LAMFIRIiVIRTPFLSIGS 
GTG GTG ATG GCT TTT TAC ATT GAC GTG CAG ATG GGC TTT CTT TTC CTA TTA CTT 

VVMAFYIDVQMGFLFLLL 
TTA CCA ATT TTT AGC CTT ATT CTC TTT ATT ATC ATT AAA GTG ACT GTG CCT TTA 

LPIFSLIIiF.IIIKVTVPL 
TAT CAA AAA GTC CAA GAA TAT TTG GAT CGG TTA AAC CGT CAA ATC AGT CAA AAC 

YQKVQEYLDRLNRQI SQN 
TTA AGC GGT GTC CGT GTG ATC CGT GCG TTT GCT AGA AAG GAA ACA GAG CAA CGA 

LSGVRVIRAFARKETEQR 
CAT GTT GAT AAA GCT TCA GAT GAT TTA GGC GAT ATT TAC ATT CGT GTA TCG AAT 

HVDKASDDLGDIYIRVSN 
GTC TCT GCT TTA TTA ACG CCT TTA ACC ACT TTG ATT ATG AAT GTT GGA ATT TTA 

VSALLTPLTTLIMNVGIL 
TTT TTA CTT TAT TTT AGT GGC TTA AAA GTT TCT TTT GGT TCC TTA CAA CAA GGG 

FLLiYFSGLKVS FGSLQQG 
GAA GTT TTA GCA TTG ATC AAT TAT ATG AAT CAA ATG ATG CTC GCT TTA ATT GTT 

EVLALINYMNQMMLALIV 



Fig. 4A 
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GCT TCT AAT CTC GTG GTA ATT TTT ACG CGA GCT GCC GCT TCC GCA AAC CGT GTC 

ASNLVVIFTRAAASANRV 
AAT GAG GTT TTA ACT GTA GAA AGC CAG TTA ACA GAT ACA CCA GAA TCA GCA AAA 

NEVLTVESQLTDTPESAK 
ACA TCT CCA CAA TTT GGT GAT ATT ACC TTT GAC CAT GTA GAT TTT CGC TAT GAA 

TSPQFGDITFDHVDFRYE 
CCA GAG GCC GGT TTG GCT TTG GAA AAT ATC AAT TTT ACG ATT CCT AAA GGC TCA 

PEAGLALENINFTIPKGS 
ATT CTC GGC ATC ACA GGA CCT ACC GGC AGC GGG AAA AGT ACC TTA ACC CAA CTC 

ILGITGPTGSGKSTIjTQL 
ATT CCC AGA TTT TAT GAT GTG AGT GCA GGA AAC CTT TTT ATT AAT GGT GTA AAT 

IPRFYDVSAGNLFINGVN 
GTG CGC GAT TGG CCG CTC TTT ACT TTA CGC CAA CAA GTT GCA AGT GTT CCA CAA 

VRDWP Ij FTLRQQVASVPQ 
ACT GCC GTC TTA TTT ACA GGG ACT ATT CGA GAA AAC TTA CAA TGG GGC AAA CCA 

TAVIiFTGTIRENLQWGKP 
AAT GCA ACT GAT GAA GAC TGT TGG GAA GCA TTA GCT ATC GCC CAA TGT AAA GAA 

NATDEDCWEALAIAQCKB 
TTT GTT GAA CAA TTA GAC CAA GGG TTA GAC ACG CCT GTT AAC GAA GGC GGA AAA 

FVEQLDQGIiDTPVNEGGK 
AAC TTT TCT GGT GGA CAA AGA CAA CGG TTG ACC ATT GCC CGT GCC TTG ATT AGG 

NFSGGQRQRLTIARALIR 
AAA CCG CAT TTA CTT ATT TTA GAT GAT TCC TTG AGT GCC CTC GAC TAT CAA ACA 

KPHLLI LDDSLSALDYQT 
GAT TTG AAT CTG CGT CGT GCT TTA CAA AAA GAA CGA GCA GAG ACA ACC GTT ATT 

DLNLRRALQKERAETTVI 

TTA ATT TCA CAA CGC GTG AGT TCG ATT GCA ACG GCG AAT CAA ATT TTA GTC CTA 
LISQRVSSIATANQILVL 

GAT AGT GGA AAA GTC GCT GGC CTT GGC ACC CAC GAA GAA TTA CTT ACT TCT TCT 
DSGKVAGLGTHEELLTSS 

AAA GAG TAT CAA GAA ATC GTT GCG TCA CAG GAG GAG GAT ACC CAT GCA AAC TAA 3 1 
KEYQE IVASQEBDTHAN* 

Fig. 4B 
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AGC CAA CCA ATG GGG AAT ATT ACG CAG TTG TCT GCA GCT TTA CAA AGC GCT TCA 

SQPMGNITQLSAALQSAS 
GCT TCG ACC ATG CGG ATT TTT GAA ATC CTA GAT GAA CCA GAA GAA GAA CTT AAC 

ASTMRIFEILDEPE EEIiN 
GAA CAA GAT GTT CCT TTG CCA GAA CCT ATT TTA GGC TCT GTT GAA TTT GAA AAT 

EQDVPLPEPILGSVEFEN 
GTC AGC TTT AGT TAT GAC CCA GAA AAA CCG TTA ATT CGT AAT TTG AAC TTT AAA 

VSPSYDPEKPLIRNLNPK 
GTT GAT GCG GGC CAA ATG GTT GCG ATT GTG GGA CCA ACT GGC GCT GGG AAA ACA 

V.DAG QMVA I VG P TG A G KT 
ACC TTA ATC AAC TTA CTG ATG CGT TTT TAT GAT GTA ACA GAA GGC GCC ATT AAA 

TLINLLMRFYDVTEGA IK 
ATT GAT GGC ATT GAC ACG AAA AAA ATG AAC CGT AGT GAT GTC CGA TCT GTA TTT 

IDGIDTKKMNRSDVRSVF 
GGA ATG GTA TTG CAA GAT GCT TGG TTG TAT AAA GGT ACC ATT GCA GAT AAC ATT 

GMVLQDAWLYKGTIADNI 
CGT TTT GGG AAG TTA GAT GCC ACG GAT TAT GAA GTT GTC GAT GCA GCG AAA ACG 

RFGKLDATDYEVVDAAKT 
GCC AAT GTG GAT CAC TTC ATT CGG ACA ATG CCA GAC GGG TAT GAA ATG GAA ATC 

ANVDHFIRTMPDGYEMEI 
AAT TCT GAG GGA GAT AAC GTT TCC CTT GGT CAA AAA CAA TTG TTG ACC ATT GCC 

NSEGDNVSLGQKQLLTIA 
CGA GCG GTA ATT TCT GAT CCG AAA ATT TTG ATT TTA GAT GAG GCG ACT AGT TCA 

RAVI SDPKILI LDEATSS 
GTC GAT ACA CGC TTG GAA GCC TTA ATT CAA AAA GCA ATG GAT CGT GTT ATG GAA 

VDTRLEAliIQKAMDRVME 

GGA CGA ACG AGT TTC GTT ATT GCC CAC CGC CTA TCA ACT ATT CGT GAA GCT GAT 
GRTSFVIAHRLSTIREAD 

TTA ATT CTT GTT ATG AAA CAA GGA GAA ATC ATT GAA AAA GGT ACG CAT CAT GAG 
LI LVMKQGEI I E KGTHHE 

TTG CTG GAA CAA GGT GGC TTC TAT GAA AAA CTA TAC AAT AGT CAA TTT GCT GAA 
LLEQGGFYEKLYNSQFAE 
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GAA GGC GAC TAT GAG GAA TAA 3 1 
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5 1 ATG ttt TAC A CA GCT ATT CTC TTT GCA TTT GGC TTA TCA ATT ATC AAT ATT CTT 
MPYTAILFAFGLSI INIL 
TTG CCT AGA GTC ATT CAA GTT TTT ATG GAT AAT TAT TTA ACG CCT AAA ACT GCT 

L P R V I'QVPMDNYLT PKTA 
ACA ATG CAA GTG ATT CTA ACT TTT GCT GCT ATT TAT TTC TTT GGT GTC ATT GTC 

TMQVILTFAAIYPPGVIV 
AAA AGT ATT GTT TGG TTT TTC CAA TGG TAT TTG TAT TCG ATG GCT TCT TTA AAA 

KSIVWFFQ WYLYSMASLK 
ACG TAT CAA TAT ATA CGA GTG AAA TTA TTT GAA AAA TTA CAC ACT TTA GGT ATG 

TYQYIRVKLPEKLHTliGM 
CGT TAT TTT GAT CAA ACA CCA GCG GGT TCG ACG GTG TCT CGT GTT ACA AAT GAC 

RYFDQTPAGSTVSRVTND 
ACA GAA ACA TTG TTT GAA TTC TGG TAT GTA TTC CTA ATG GTG ATT ACA GGA ATT 

TETLFEFWYVPL MVITGI 
TTT GCC GTA ATC TCG TCA TTT TTT GCA ATG TTT CAA CTC AGT CCA GAA ATC TCA 

FAVI SSFFAMFQLSPB IS 
TTT TAT TGT CTA ATT TTC TTG CCG ATT TTA TTA GTT GTG ATT TGG TAT TAC CAA 

FYCLIFLPILLVVIWYYO 
AAA TTT AGT TCA AAA TTA TAT CGC AGT ATG CGG GAA AAA TTA AGT CAA TTA AAC 

KFSSKLYRSMREKIiSQLN 
ACC AAA CTA AAT GAA TAC ATC TCT GGT ATG CAA ATC ATT CAA CAA TTT CGC CAA 

TKLNEYISGMQI I QQFRQ 
GAA AAG CGT TTG GAA AAA GAA TTT GAG GAA ACC AAT GAT GAT TAT TTA AAA ACA 

EKRLEKBFEETNDDYLKT 
CGA GTA GCA ATG ATT CGT ATG AAT TCG TTA TTA TTA AGT CCC ATC ATT AAT TTG 

RVAMIRMNSLLLSPI INL 
TTG TAT ACG TTG GCG ATT GCA TTG GCC TTA ACT ATG TTT GGC ATC GAT GCT TTA 

LYTLAIAIjALTMFGIDAL 
CAT TCA CCT GTT GAA GCG GGG ATG ATT TAT GCT TTT GTA ACG TAT GTC CAA GCT 

HSPVEAGMI YAFVTYVQA 
TTT TTT AAT CCG ATG ACG CAA ATG ATG GAC TTC CTA AGT ATC TTT ACA GAT GGG 
FFNPMTQMMDFLSIFTDG 
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ATT GTC GCA GGT AGC CGT ATT TTA AAA ATT ATG GAT ACG GAA GAA TTA ACG CCG 

IVAGSRILKIMDTEELTP 
CAA CAA TCA GTA GGT GCG AAT GGG GAA ATT ATT CGC GGG AAA ATT GAA TTT CGC 

QQSVGANGEIIRG KIEFR 
AAT GTT ACT TTT TCT TAT GAC GGC AAA AAC GAA GTA TTG AAA AAT ATT AGC TTT 

NVTFSYDGKNEVLKNI SF 
GTC GCA AAT CCC GGG GAA ACC GTC GCT TTA GTC GGT CAT ACA GGA AGT GGT AAA 

VANPGETVALVGHTGSGK 
AGT TCA ATT ATT AAT GTA CTC ATG CGT TTC TAT GAA TTT TAT GAA GGG CAG ATT 

SB I INVLMRFYEFYEGQI 
TTA ATT GAT GAT CGT GAC ATT CGT GAT TTT CCC ATG ACA GAA TTA CGG GAA AAG 

LIDDRDIRDFPMTELREK 
ATG GGC TTA GTA TTA CAG GAC GCT TTC ATG TTT TAT GGA GAT ATT GCT GGA AAT 

MGLVLQDAFMFYGDIAGN 
ATC CGT TTA CTG AAT CCC AAT ATT ACA GAT GAG CAA ATT AAA CAG GCT GCG GAA 

IRI>LNPNI TDEQI KQAAE 
TTT GTT CAG GCA GAT AAA TTT ATT CAC ACG TTA CCC AAT ACC TAT CAT GCG AAA 

FVQADKFIHTIiPNTYHAK 
GTC ATT GAA CGA GGA GCG AGT TAT TCT AGC GGG CAA CGT CAA TTA ATT TCG TTT 

VlERGASYSSGQRQliISF 
GCA CGA ACG ATT GTG ACT GAT CCC AAA ATT TTA GTT TTA GAT GAA GCA ACC GCT 

ARTIVTDPKIIiVLDEATA 
AAT ATT GAT ACA GAA ACA GAA GGA TTG ATT CAA GAA GGA TTA GCC AAA ATG CGT 

N I DTETEGIilQEGLAKMR 
CAA GGG AGA ACG ACA ATT GCG ATT GCG CAT CGC CTT TCG ACG ATT CGT GAC GCA 

Q'GRTTIAIAHRLSTIRDA 
AAC TTG ATT TTA GTT TTA GAC AAA GGG CAG ATT GTA GAA CGA GGA ACG CAC GAA 

NLILVLDKGQIVERGTHE 
ACG TTA CTG GCA GAA GGT GGC CTT TAT GCG GAT ATG TAT CAG TTA CAA AGT ACG 
TIiLAEGGIiYADMYQ-LQSTJ 

GAA GTT TAA 3 ' 
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5 » ATG AGT ACT CGA CCA AAA GGC GGA CCA ATG GGC GGT GGA CCA GGA CGC AAC ATT 



M S 


S 


R 


P 


K 


G 


G 


P 


M 


G G 


G 


P 


G R 


N I 


GGC GCA AAA GGG 


CCA 


AAA 


CCA AAA AAT TTC TGG AAA ACA GTG AAA CGT 


TTA TTT 


G A 


K 


G 


P 


K 


P 


K 


N 


F 


W K 


T 


V 


K R 


I* F 


CGA TAC ATG 


TCT AAA 


CGA ATG 


CTG 


• TCA 


ATT 


ATT GCT 


GTC 


TTA 


GTA TTA 


GCA ATT 


R y 


M 


S 


K 


R 


M 


L 


S 


I 


I A 


V 


L 


V L 


A I 


GCA GCC GTC GTT TTC 


CAA ATT CAA ACA CCA AAA GTA TTA 


GGA CAG GCA ACA ACT 


A A 


V 


V 


P 


Q 


I 


0 


T 


P 


K V 


L 


G 


Q A 


T T 


GAA ATT 


TTT AAA 


GGC 


GTC ATG AAA GGC GCT GCT GAA ATG AAA 


CAA GGC 


TTA AAA 


E I 


F 


K 


G 


V 


M 


K 


G 


A 


A E 


M 


K 


Q G 


Ii K 


ATA ACA TCT 


TTT 


CCA ATT 


GAT TTT GAC AAA ATT GGT 


CAA ATT 


TTA TTA 


ATT GTT 


I T 


S 


P 


P 


I 


D 


F 


D 


K 


I G 


Q 


I 


L L 


I V 


ATT GCC 


ATG 


TAC 


CTT 


ATT 


TCT 


GCT 


GTC 


TTT 


AAT TTC 


TTG 


CAG 


CAA GTA 


ATT ATG 


I A 


M 


Y 


L 


I 


S 


A 


V 


F 


N F 


L 


Q 


Q V 


I M 


ACA CGT 


GTT 


TCA 


CAG 


CGA 


ACA 


GTT 


TAT 


GAA 


TTA CGT 


CAA 


GAA 


TTA GAA 


GCG AAA 


T R 


V 


S 


Q 


R 


T 


V 


Y 


E 


L» R 


Q 


E 


L E 


A K 


ATG AAC 


AAA 


GTT 


CCA ATT 


TCC 


TAT 


TAT 


GAT 


ACT CAT 


AGT 


AAT 


GGG GAC 


ATT ATG 


M N 


K 


V 


P 


I 


S 


Y 


Y 


D 


T H 


S 


N 


* G D 


I M 


TCA CGG 


GCA 


ATT 


AAC 


GAC 


ATG 


GAT 


AAT 


ATT 


GCG AGT 


ACT 


TTA 


CAG CAA AAC TTA 


S R 


A 


I 


N 

i 


D 


M 


D 


N 


I 


A S 


T 


L 


Q Q 


N L 


ACC CAA 


TTA 


ATT 


ACC 


ACT ATC 


GTG 


ACA 


TTT 


GTG GGT 


GTA 


CTT 


TGG ATG 


ATG CTG 


T Q 


L 


I 


T 


S 


I 


V 


T 


F 


V G 


V 


L 


W M 


M L 


ACC ATT 


AGC 


TGG 


CAG 


TTA ACA 


CTA 


ATT 


GCC 


TTA GCA ACA 


GTG 


CCA TTA 


AGT TTA 


T I 


S 


W 


Q 


L 


T 


L 


I 


A 


Ii A 


T 


V 


P L 


S L 


ATT GTT GTG ATG 


GTT GTG GCG 


CCT 


CGC 


TCG CAA AAA CAT 


TTT 


GCG GCT 


CAA CAA 


I V 


V 


M 


V 


V 


A 


P 


R 


S 


Q K 


H 


F 


A A 


Q Q 


AAA AGC TTA GGA TTA 


TTG 


AAT 


AAC 


CAA GTC GAA GAA ACC TAC 


GGT GGT 


CAC GTA 


K S 


L 


G 


L 


L 


N 


N 


Q 


V 


E E 


T 


Y 


G G 


H V 


GTA GTA AAA AGT 


TTC AAC 


CAC GAA GAA AGT GAT CAA GAA GTA 


TTT GAA AAA GAA 


V V 


K 


S 


F 


N 


H 


E 


E 


s 


D Q 


E 


V 


F E 


K E 


AAT GAA AAA 


TTA 


TAT 


CAT 


GCT 


GGT 


CGC AAA GCA CAA TTT 


ATT 


TCA GCG 


ATC ATT 



N'EKIi Y HAG R KAQ F I S A I I 



Fig. 7A 



WO 01/79257 



13/77 



PCT/USO 1/12230 



ATG CCT TTA ATG AAC TTT ATC AAA AAT CTA GGC TAC GTG TTT GTT GCA GTC CTT 
K V L M N P I KNLGYVFVAVL 

GGT GGC GTA AAA GTA GCG AAT GGT ATG ATG GAT TTA GGG GAT GTC CAA GCA TTT 
GGVKVANGMMDLGDVQAP 

CTT CAA TAT ACC AAT CAA TTT TCA CAA COG ATT ACT CAA ATC GCT AAT TTA ATG 

LQYTNQFSQPITQIANLM 
AAT ACA ATC CAA GCT ACQ GTG GCT TCG GCA GAA CGT GTC TTT GAA GTA TTA GAT 

NTIQATVASABRVFEVLD 
GAA GAA GAA ATG GTG GAT GAA CCT TCT GGC ATA CCA GTG GAA ACA GAT AGT CCT 

EEEMVDBPS GI PVETDSP 
TAT CGT GTT TCT TTT GAA CAT GTT GCT TTT GGC TAT TCA CCA GAG AAA TTA TTA 

yRVSFBHVAFGYSPEKLL 
ATG AAA GAT TTC AAT TTA AAT GTT AAA CCT GGG GAA ATG GTC GCA ATC GTG GGC 

MKDFNLNVKPGEMVAIVG 
CCA ACA GGT GCT GGG AAA ACA ACC CTA ATT AAC TTA TTG GAA CGC TTC TAT GAT 

PTGAGKTTIilNIiIiERFYD 
ATT AGT AGC GGC AGC ATC AAA TAT GAT GGC GTA GAT ACG CGC GAT TTA TCT CGC 

ISSGSIKYDGVDTRDLSR 
GAA GAG TTG CGA GCA CAC TTT TCA ATG GTT CTT CAA GAT ACT TGG TTG TTC ACT 

EEIiRAHFSMVLQDTWLFT 

GGA AGT ATC TAT GAC AAT ATT CAT TAT GOT AAT GAG CAA GCG AGC GAA GAA GAA 
GSIYDNIHYGNEQASEEE 

GTG ATC CGT GCG GCG AAA GCA GCC CAT GTG GAT GAT TTT GTC AGA AAA TTA CCA 
VlRAAKAAHVDDF VRKLP 
GAA GGC TAT CAA ACG ATT CTA AAT GAA GAA GCC AGC AAT ATT TCT CAA GGT CAA 

Vgyqtilneeasnisqgq 

CGA CAA TTA ATT ACA ATT GCT CGA GCA TTC TTA GCA AAT CCA GAC GTT TTA ATT 

RQLITIARAFLANPDVLI 
TTG GAT GAA GCC ACC TCA AGT GTG GAC ACT CGA ACA GAA ATA CTG ATT CAA GCA 

ldeatssvdtrteiliqa 

GCA ATG AAT CGT TTA TTG GAA AAT CGG ACC AGC TTT GTA GTC GCT CAT CGC TTG 
AMNRL1.ENRTSFVVAHRL 
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TCG ACT ATC CGT GAT GCC GAT ACG ATT ATC GTT ATG GCA GAA GGT TCA ATT GTA 
STIRDADTIIVMAEGS IV 

GAA ACA GGA ACC CAC GAT GAA TTA ATG GCG AAA AAC GGT TTT TAT GCT GAC TTA 
ETGTHDELMAKNGFYADL 

TAT AAC AGT CAG TTT TCA GAA GAA GTA GCC TAA 3» 
YNSQFSEEVA* 
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5 » ATG TCA TTA TGG TCG GTC TTT GCA GCG GTT GTA TTT ATG ATC GTA CAA ATT ATC 
MSLWSVFAAVVFMIVQII 
GCA GAT TTA TAT CTG CCC ACA TTA ACA TCT AAT ATC ATT AAT GAC GGT GTC GCT 

ADbYLPTLTSNIINDGVA 
AAA GGA GAT ATT AAC TAT ATT TGG AGT ACA GGA GTT GTC ATG TTA GGC TTT TCT 

KGDINY IWSTGVVMLGPS 
TTG ATT AGC ATT GTC GCT TCT ATT GGA AAT ACC TTT TTT GCC ACC AAA GAA TCA 
" LI S IVA SI GNTFFATKES 
CAA AGT TTA GGA AAA AAA TTA CGG ACT GAT ATT TAT CGA AAA GTT GAA AAT TTT 

QSLGKKLRTDI YRKVENF 
TCT AAT AAT GAA TTT GAT AAA TTT GGG ACA GCT TCG CTA ATT ACA CGA ACA ACC 

SNNEFD KPGTASLI TRTT 
AAT GAT GTC AAT CAG ATT CAA ATG GTT ATG CAA ATG TTT TTA CGT TTA ATG ATT 

NDVNQI QMVMQMFLRLMI 
AAC GCA CCT TTA ACG TTA ATT GGT GCA AGT TTT ATG GCG TAT AAC AAG GAC CCA 

NAPLTIi I GAS FMAY NK.DP 
CAA TTA ACG AAA ATT TTT CTA TAT GTT TTA CCA ATT ATG GCT GTT TTA GTG GGA 

QLTKI FLYVLPIMAVLVG 
GGT ATC ATG TTT TTA GCG GTT CCG TTA TTT AAA AGC ATG CAA AAG AAA ACA GAC 

GIMFLAVPLFKSMQ KKTD 
CGT TTA AAT TTA GTT TTT CGT GAA GGA CTG ACA GGA GTT CGT GTG ATT CGG GCC 

RLNLVFREGLTGVRVI RA 
TTT GGC AAA GCA AAC TAT GAA GAA CAA CGT TTT GAT GAA GCC AAT AAG GAT TAT 

FGKANYEEQRFDBANKDY 
ACA CAA ACG GCG ATT AAA GTG AAT ACG ATT GTC GCT TTG ATG ATT CCT TTA ATG 

TQTAIKVNTIVAI1MIPI1M 
ACA CTC ATT ATG AGT GGT ACC AAT ATT GCA ATC ACC TGG TTT GGC GGT CAT TAT 

TLIMSGTNIAITWFGGHY 
ATT GCG GAA ATG CAA CTA GAG GTA GGT AAC TTA ATT GCC TTT ATG ACT TAT GCC 

IAEMQLBVG.NLIAFMTYA 
ATG CAA ATT CTG ATG AGT TTC ATG ATG TTA TCC ATG ATT TTT GTC ATG GTA CCG 
MQIIiMSFMMLSMIFVMVP 
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5 » ATG TTA AAG CGT TTT TTT GGG TAT TAT CGG CCG TAT CGT CGA TTA TTT ATT TTG 
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AAA GTC AAC AAT ATG ATT GAA AGC TAT CCC AAA GGA TTT GCT GGC TTT AAA 'CGT 

KVNNMI ESYPKGFAGFKR 
TTT ACG GAA GAA ATG GAC AAA CAA CCG TCC ATC AAA GAT TTG CCA GGA GCT GTT 

FTEEMDKQPSIKDLPGAV 
GCT GTT TCA CAT TTA GAA GGA ACG ATT GCC TAT AAG GAT GTT TCA TTT GCT TAT 

AVSHLEGTIAYKDVSFAY 
GAA GAT GGC ACA AAA GTT TTA GAC CAT ATC AAT TTA AAA ATT CAA CCT GOT GAA 

EDGTKVLDHINLKIQPGE 
ACC GTT GCT TTC GTT GGA CAA ACT GGT TCA GGA AAA ACG ACC TTG TGC AAT CTG 

TVAFVGQSGSGKTTLCNL 
TTG CCT CGT TTT TAT GAA GTC ACT AGT GGT GAA ATT ACG ATT GAC GGT AGA AAT 

LPRFYEVSSGEITIDGRN 
ATC CAG AAA ATG ACC TTG GCT TCA TTA CGT AAG CAG ATT GGA ATT GTT CAA CAA 

IQKMTLASLRKQIGIVQQ 
GAT GTG TTT TTA TTC CCT GGA ACA TTA CGA GAA AAT ATT GCT TAT GGG AAT TTA 

DVFLFPGTLRENIAYGNIi 
AAT GCT ACA GAA ATA GAC ATT CAA CAA GCG GTG AAA TTA GCA CAT TTA GAA CAT 

NATE IDIQQAVKLAHIiEH 
GTA ATT CAG CTA ATG CCA GAT GGC TTG GAC ACA ATT ATT GGC GAA CGT GGC GTG 

VIQIiMPDGLDTIIGERGV 
AAA TTA TCG GGC GGT CAA AAG CAG CGA GTG GCA ATT GCG CGA ATG TTC TTG AAG 

KLSGGQKQRVAIARMFLK 
AAC CCG CCG ATT TTA ATT CTA GAT GAA GCG ACT TCT GCA TTA GAT ACA GAA ACA 

N P P I LI LDEATSALDTET 
GAG CAA GTG ATT CAA GAG TCA TTA AAC TCG TTA GCT GAC GGG CGA ACA ACA TTA 

BQVIQESLNSLADGRTTL 
ATT ATT GCC CAC AGA CTG GCA ACG ATT AAA CAT GCT GAT CGA ATA ATC GTA GTG 

I IAHRLATI KHADRI IVV 
AGT GAC CAA GGA ATT TTA GAA GAT GGA ACA CAT GAA ACT TTG TAC GCG CAA AGA 
SDQGILEDGTHETLYAQR 

GGC CAC TAT CGT CGC TTG TAC GAT GCT CAA TTT AGA ACA TAA 3 ' 
GHYRRLYDAQFRT* 
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5' ATG GAA AAA CAG CCG ACA ACG AAA GAC GTT TGG AAA AAC GAT CAA TGG GTT CGT 
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TTT TTT GAG 


GCA 


TTA AAA GAT AAA ACC GTC ATC 


TGG ATT ACC 


CAC 


CAT TTG CAA 


F F E 


A 


L K D K T V I 


WIT 


H 


H L Q 


GGC GTA ACT 


TTG ATG GAT CAA GTG ATC TTT ATT 


GAA GAT GGC 


CAA TTA GAG ATG 


G V T 


L 


M D Q V I F I 


EDO 


Q 


L E M 


AGT GGT ACG 


CCA GAA GAA TTA CTT GCA ACC AAT 


GCG CAT TAT 


CAG AAA CTT TAC 



SGTPEELLATNAHY QKLY 
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CGA ATT GAT CGT GGC ATT AGT TCT TTC GAG GAA TAA 3 1 
RIDRGISSFEE* 
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5 ' ATG AAA GAG TTT ATT AAA GAA AAT AAA TGG ATT GTT CTT GCG ACA ACT TTA ACC 
MKEFIKENKWIV LATTLT 
ATC TGT TTG CAA ATC GCA GGA ACC CTT GGC GTC CCT AAA TTA GTT GGC AAG TTG 

ICLQIAGTLGVPKLVGKL 
ATT GAT GTG GGC ATC GTT AGC GGT GAC CAA CAA GTA ATT AAM ACG ATT GGC ATA 

IDVGIVSGDQQVIXTIGI 
CAA ATG TTT CTT GTG GCA TTC ATT GGA ACA ATT GCC GCA ATT ATT TCT AGC TAT 

QMPLVAFIGTIAAI ISSY 
TTG TCT GCT TTA GTA GCT GCT AAA TTT GGT TTT CAA GTT AGA GGA TTG TTC TTT 

LSALVAAKFGPQVRGLPP 
AAA AAA TTT CAA CAA TTC TCG ATG AAA AAT GTT GAT AAA TTT GGT TCA AAT TCT 

KKFQQPS MKNVDKFGSNS 
TTG CTA ACT AGA ATG ACC AAC GAT GTA GAT AAT GTT CAA ACA ATG ATT GTG CTG 

LLTRMTNDVDNVQTMIVL 
TTT TGC CAA TTA ATC TTT CCA GCG CCT ATT ATT AGT TTA TTT GCC TTA GTG ATG 

FCQLIPPAPI ISIiFALVM 
ACA TTT TCT TAT TCA GTT TCA CTC GCT TGG GTA ACA TTG GCT TCC ATC OTA TTT 

TFSYSVSLAWVTLASIVF 
TAC TTA GTC GTT GTT TAT TTT TTA ATG AAA AAA GGT ACC CCT TTA TCT TTA AAA 

YLVVVYFLMKKGTPLSIjK 
ATT CAA CCA AAA ATG GAT CGA ATT ACT ACG ACT TTA CGA GAG TTC TTT ACT GGA 

IQPKMDRITTTLRBPPTG 
ATT AAT ATG ATT CGT GCG TTC AAT AAT CAA GAT TTT GAA GAG CAG CGA ACC AAT 

INMIRAFNNQDFEBQRTN 
CAA ACA TTT AAA AAT TAT GCT GAA CGC ATG AGT AAA GTG AAT CAA ATC TTT GCT 

QTFKNYAERMS KVNQI FA 
TGG ATT ACT CCC GTT GCC TTT TTA TTA ATG GGC GTT GTG TAC GCC TCT ATT TTG 

WITPVAFLLMGVVYASIL 
TGG TTT GGC GGT AAT TTA GTT GCA GTA GGC ACC CTA CAA ATT GGC ACC GTT ACA 

WFGGNLVAVGTLQI GTVT 
GCT GTG ATT GAA TAT ACG CTT CTA ACT TTG GCC TAC TTA ATG ATC GCG GCT ATG 
AVIEYTLLTLAYLMIAAM 
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GTA TTA GTT GTC ATT CCA CGG TCC GTT GCT TCC TTG AAT CGC TTG CAA GAA GTT 
"y" "l "v "v " V P R S V AS h N R Ii Q E V 
TTG TCA GAA GAA ATT GAA ATT AGC GAT CCT CAT ACT GAG GCA ACC ATT GCT TAT 
"b" S V "b I B I S D P H T E A T I AY 
CAT CCT GAG AAA GCC TTG ATT TGC TTC GAT CAC GTC ACG TTT CAA TAC ACA GAA 
T T "i" K "a" T I C P D H V T F Q Y T E 
ACA GCT GAT CCT GTT TTA GAA AAT GTT AGT TTT GTC ATT CCT AAA GGA AAA ACA 
"t" A "d "p V It E N V S F V I P K G K T 
ACG GCG ATT GTT GGT GCA ACT GGC GCT GGT AAA AGT ACT TTA GTT AAG TTA CTT 

TAI VGATGAGKSTLVKLL 
TTA CGA ATA AAT GAG GTC ACA GCC GGC ACG ATT AGC TAT TCT GGC ACA GAT ATC 

"l'rinevtagti SYSGTDI 

CGC TCA TTA TCT CAG CAA ACG ATT CGC CAA GTC ATC AGT TAT GTG CCA CAA AAA 
R ~ S b~ S Q Q T I R Q V I S Y V P Q K 

GCC TTT CTT TTC AGT GGG ACA ATC TTA TCA AAC TTA TTA ATG GGA AAT GCC AAA 
APLF SGTILSNLLMGNAK 

GCA ACT ACA GAA GAA ATA AGA ACG GCA CTA GAA ATT TCA CAA TCT TCT GAA TTT 

"a' ttbeirtaleisqssef 

ATC GAT TCC TTA CCA CAA GGG ATT GAA AGT TTC GTA GCA CAA GGC GGG TCC AAC 
"T D _ s" h' P Q G I E S F V A Q G G S N 
TAT TCT GGT GGT CAA AAA CAA AGA ATG TGT ATT GCA CGA GCC TTA ATC AAA CCG 
~Y~ ~S _ "g G Q K Q R M C I ARAL I KP 
GCA GAC GTT TAT ATT TTC GAT GAT AGC TTT TCC GCA TTA GAT TAC AAA ACT GAT 

A *D "v "y I F D D S F S A L D Y K T D 
GCC GCT CTA CGT GCC GCT TTA CAT GCA CAA ATG TCG GAC AAA ACT TTA CTC ATT 

a "a ~LRAALHAQMS DKT L L I 
GTT GCT CAA CGG TTA AGT ACA ATC ATG AAC GCT GAC AAC ATT ATC GTC CTA GAT 
"v" "a" Q R h S T I M K A D H I IVLD 
GAA GGA AGG ATT GTT GGT CAA GGC ACC CAC GCT GAT TTA CTT ACC ACT AAT AGC 
T "<T R "i" V g" Q G T H A D L L T T N S 
TAT TAC CAA GAC TTT GCT AAA TCG CAA GGT ATC TTA CCC AAG TAA 3 1 
YYODFAKSQGILPK* 
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5 1 ATG AAA AAT TTT TAT AAA AAA AAA TTT GCT TTA ACT GAT CAG GGC GCA GAA GCT 



MKNFYKKKF ALT 


D Q G A 


E A 


TTA ACT AAA GCT TCT ATT TCT AGT TTT TTT GTA TAC TGC ATT AAT ATG 


GTA CCA 


LTKASISSFPVY 


C I N M 


V P 


GCA TTC ATT ATA ATG ATG CTG ATA GAC GAA TTA GTT 


TTA GAA AAT GCA 


AAG CCC 


AFIIMMLIDBLV 


LENA 


K P 


CGT TGG CTT TAT TTT GCA GTA TCG TTT GTT ACT TTC CTT TTC ATG TAT TGG TTG 


R W LYFAVS FVTL 


L F M Y 


W L 


TTA GAT AGG GAG TAT GAA AAC TTA TAT AAT AGT ACT TAT AAA GAA AGT GCG CAT 


LDRBYBNLYNST 


Y K E S 


A H 


TTA AGA GTG CAA ATT GCA GAC GAT TTG TCA AAT TTA 


CCA TTA TCC TAT 


TTT TCA 


LRVQ IADDLSNL 


P L S Y 


F S 


AAA CAT AAT TTA TCA GAT TTA TCT CAA ACT ATC ATG 


TCT GAC GTT GAA 


GGT ATT 


KHNLSDLSQTIM 


S D V E 


G I 


GAG CAT GCG ATG AGT CAT GCA ATA CCT AAA TCC GGC 


GGT ATG GCT CTG 


TTT TTC 


EHAMSHAI P K S G 


G M A L 


P F 


CCT TTT ATT TCA GTG ATG CTT TTG GTT GGT AAT GTC 


AAA ATG GGA TTA 


GCT GTT 


PPISVMLLVGNV 


K M G L 


A V 


ATT TTG CCA ACG TTA TTT AGT TTT GTC TTA ATC TTG 


TTA TCA AAG AAA 


TCC CAA 


ILPTLFSFVLIL 


L S K K 


S Q 


ACG AAA GCC AAT ACT AAA TAT TAC GAT ACT TTG AGA GAA. AAC TCG GAA GAA TTT 


TKANTKYYDTLR 


E N S E 


E F 


CAA GAA ACT ATT GAA CTG CAG CAA GAG ATT AAT AGC TTT AAT CTA TCT AAA AAA 


QETIBLQQEINS 


F N L S 


K K 


GTT CAA GAC AGA CTT TTC AAA AAA ATG GAA GAG AGT GAA AGG ATT CAT TTA AAG 


VQDRLFKKMEBS 


B R I H 


L K 


GTA GAA TTA AGT ACT TTT TCA GTC ATG GCC TTA TCC TCT ATT TTC TCA TAT GTT 


VBLSTFSVMALS 


S I F S 


Y V 


AGT TTA GCA GTA GTC ATT CTA GTA GGT GTT CAC TTA CTG TTA ACG GGT GAA GTG 


SLAVVILVGVHL 


L L T G 


E V 


ACT ATA CTC TAC GTC GTT GGT TAC TTA CTA GCC GCA ATA AAA ATA AAG GAT TCC 


TI LYVVGYLLAA 


I K I K 


D S 
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TTC GAT TCT ATG AAA GAA GCA GTG CTT GAA ATA TTT TAT TTA GCT CCT AAA ATA 

FDSMKEAVLEIFYLAPKI 
CAG CGA ATT AGA GCT ATG AAG GAA ACC AGT ATC CAA GAG GGA TCA GAT AGT CCT 

QRI RAM KETSIQEGSDSP 
CTG AAA TCT TTT GAC GTT GAG TTG CGA GAT GTC TCT TTC TCA TAT GAT AAC AAT 

LKSPDVELRDVSFSYDNN 
ACC CCA ATT TTA GAT CAT ATC TCT TTT ACG GCT AAA CAA GGA GAA GTG ACT GCG 

TPI LDH I SPTAKQGEVTA 
TTA GTA GGT GCT AGT GGT TCC GGG AAA ACT AGT ATC TTG AAA TTA GTG TCT AGA 

XiVGASG S GKTSI L K I* V SR 
CTG TAT GAT TAT GAT GAA GGC TGT ATT TTA ATT GAT GGG TAT GAT ATT AAA CGT 

LYDYDBGCILIDGYD I KR 
GTA TCT CCA GCT TCT TTG TTT AGC AAA ATC GCT ATT GTT TTT CAA GAG GTA ACT 

VSPAS L FSKIAIVPQEVT 
TTG TTT AAT ACA AGT ATT TTG GAA AAT ATA CGT ATT GGA AAT AGT CAA GCG TCA 

LFNTSI LENIRIGNSQ AS 
GAT GAG GAA GTG AAA AAA GCA GCT CGT TTA GCT AAC TGC GAA GAC TTT ATT GAA 

DEEVKKAARLANCBDFIE 
AAA CTA CCT GAC GGT TAT CAT ACA TTG GTC GGA GAA AAC GGA AGT TCA CTT TCT 

KLPDGYHTLVGENGSSIjS 
GGA GGA GAG CGA CAA CGC TTA TCC ATA GCA CGG GCT TTT TTG AAA AAT GCG CCA 

GGERQRIiSIARAFIiKNAP 
ATT TTA ATT TTA GAT GAA ATT ACT GCC AGT TTA GAT GCG GAA AAT GAA AAA CGT 

ILILDEITASLDAENEKR 
ATT CAG GAA AGT TTA AAT CGT TTA ATA CAA GAT AAG ACA GTA CTA ATT ATA TCG 

IQESLNR LIQDKTVLI IS 
CAT CGC TTG AAG TCT ATT GAA AAG GTA AAT AAA ATT GTA GTG ATG GAT CAA GGA 

HRLKSI EKVNKIVVMDQG 
AAA GTA GTT GAT CAA GGT ACT CAT AGT GAG TTG TAT CGA AGG TCA GAA ATA TAC 

KVVDQGTHSELYRRSEIY 
AAA AAT CTC ATA AAG AAA ACT AAA CTA TCT GAG AAA TTC GTC TAT GAA AAA GAA 
KNLI KKTKLSBKFVYEKE 
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5 • ATG ACT AAA AAA ATA TGG TOG CAC TTT CGG AAA AGC TTC ACT TGG TAC ATT TTG 

M "t" k" K I W S H F R K S P T W Y I L 
ATT GGC GTG ATT GTT TCA TTA TGT AGT GCT TTA GCA ATT TAT TTT TTT CAA CAG 
"l""oVIV8LC8ALAIYFPQQ 
TTG TTA GAT CAT TAT CAA AAA AGT TTC CAA CTA GGG TTG TTA GTA GCG TAT GGA 

LLDHYQKSFQI*GL^ VAYG 
ACG ACA ATT ATT TTA ATT CCT CTA TTG TCT TAC TGT GAA CAG AAA CCG AAA GCT 

TTI ILIPLLSYCEQKPKA 
TAC TTA ACT AAT GGC ATC TAT TTC TAT TTG AAG AAA TTA AGT TTA ATA AAA ATG 

YLTNGIYFYLKKLSIjIKM 
AGC AAG ATT TCC TAT GAA GAA TAT CTA AAA CTG GGA GCG GGG GCT TTA CTA CAA 

SKI SYBBYIiKIiGAGALLQ 
AAA GTA GAA GTT GGT GCA GCA GCT GGG AGA AAT ATC CAC TTG AAT TTT TAT GGG 

"k'v evgaaagrnihlnfyg 

CGC TTA TTT CGA GAG TTA ATT CCA GAG ACG TTA TTT AAT TTA TTT TTC ATA GCA 
RLFRELIPETLPNIiPPIA 

TTG ATT GAT AAA AAA TTA TTA CCA GCA ATT CTA ATT GGT TAT GTG ATT GTA TTT 
LIDKKItlaPAlLIGYVIVF 

ATT TTA ACC AAA ATA CTT TTG AAA ACG CTA CAA AAA ATG AAA GAA AAA ACA CTT 
ILTKILLKTLQKMKBKTL 

ATT TCT GAA GAA GCA ATG AAT GCT ACT TTG ATA CGA GGG ATG ACC GAA CTA GTA 
ISEEAMNATLIRGMTELV 

ACT TTT AGA ATT AAT CGA AAA TAT AAA AAA GAG ATT GAA AAT TAC GCG TTA ATG 
TFRINRKYKKEIENYALM 

GCA GAG GAA AAT AGT CAA AAT ATA ACA AAA ATG ACT ATG ATC CAT GAA TTT TTC 
ABENSONITKMTMIHEFP 

TTT GGC TTT TTT GCC TTG TTA GTT GCA CTT ATC AAA GTA AGT ATT GTC GTA CTT 
FGPPALLVALIKVSIVVL 

AGT TTT ACT AAC GTC GTG ACG CTA AGT TTA GGT GGA CTT GTA- GCG ATA GTG ATG 
SFTNVVTLSLGGLVAIVM 

TAT ATC GAT CGA ATT TAT ACG CCG ATA GCC ATA TTT AAT GTT TTA TTT GTT CAA 
YI DRIYTPIAI FNVLFVQ 
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TAT AAT TTG GAT AAA GTA GCC TAT CAG CGC TTG GAA GAT TTT TAT AAA AAA GAA 



YNLDKVAYQRL 


E D F Y K K E 


GAT GAT CCT GAC TTA GOG GTT TCC GGC AAA GCG TTA CCT GAA ATT CAA ACA ATT 


DDPD LAVSGKA 


It P B I Q T I 


TCA TTA AAA GAT GTG TGT TTT AGC GTA GAT AGT CAA ACG ATT ATG TCC CAA CAA 


SLKDVCFSVDS 


Q T I M S Q Q 


AAT CGA CAA TTT TCA ATG AAT AAA ACC TAT GGC TTG ATT GGA AAA AGT GGG ACA 


NRQFSMMKTYG 


L I G K S G T 


GGT AAA TCT ACC CTG ATT AAA TTG ATT TTA GGA TTA TTG AAG CCG ACA GAG GGG 


GKS TLIKLILG 


L L K P T E G 


ACA GTT TAT GTG AAT CAA TTT CCT CTA ACA CAA TTT AAT TTG GAA GAT TAT TAT 


TVYVNQFPLTQ 


F N L E D Y Y 


GAA AAA GTT TTT TAT TTA TCT CAA GAT GTG CCA ATA TTT CAA GGA ACG TTA AAA 


EKVPYLSQDVP 


I F Q G T L K 


GAA AAT ATT GTA TTT AAT CAA GAA ATT TCT GAT GAA CAA GTA ATT GAA GCT ATG 


ENIVPNQEISD 


E Q V I E A M 


TAT CGT TTT CAG TTG GGG GAA CTT TAT GAA CGG 


TTG CCG GAA GGT TTA AAC ACT 


YRFQLGELYER 


L P E G L N T 


ATT GTG AGT GAA AAA GGA ATG AAT TTT TCT GGT 


GGT GAA AAA CAA CGG ATT GCT 


IVSEKGMNFSG 


G E K Q R I A 


TTC ACA CGC TTA GGA TTT ACA CAA GCA GAG ATC 


CTT ATT TTA GAT GAG GCG ACT 


FTRLAFTQAE I 


L I L D B A T 


TCT GCA TTG GAC GAA AAG ACC GAA GAA AAA GTG 


TTA CAA GAA GTA CAA AAA TTT 


SALDEKTEEKV 


L Q E V Q K F 


ACC CAC AAT AAG TTG ACT ATT TTG GTT ACT CAT 


CGA CCT AAA ACC CTG CGA TTT 



THNKLTI LVTHRPKTLRP 
GTA GAT GAA ATT ATT GAT TTA AAT GAG TGA 3 ' 



VDEIIDLNE* 
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5' ATG TCC ATA TTC AAA AAA TTA GGC TGG TTT TTT AAG CAA GAG AAA AAA AGT TAT 



M S I F K K 


L 


G W F F K Q 


E K 


K S Y 


ATT ATT GGG GTT TTC TCA TTA ATG ATG GTC GCT CTT GTT 


CAA TTA 


GTC COG CCC 


I I G V F S 


h 


M M V A X> V 


Q L 


V P P 


AAA GTT ATT GGC GTC GTT GTA GAT GAA ATC GTT AAC AAA 


GAA ATT 


CGC TTA ACG 


K V I G V V 


V 


D B I V N K 


E I 


R L T 


AAA ATT ATC GTG TGG GTT GCA CTC TTG ATT GGT GCT GGG 


CTT GCC 


CAA TAT CTT 


K I I V W V 


A 


L • L I G A G 


Ii A 


Q Y L 


TTT CGC TAT ATT TGG CGG ATG 


CAT ATT TGG GGG AGT GCG 


GCT CGT 


TTG GAA AAA 


F R Y I W R 


M 


H I H G S A 


A R 


Ii E K 


GAG CTA CGG ACT CAA TTA TTT CAT CAT TTC ACA AAA ATG 


GAT AGC 


ATC TTT TAT 


E L R T Q L 


F 


H H F T K M 


D S 


I F Y 


CAG AAA TAT CGG ACA GGT GAC 


TTG ATG GCG CAT GCA ACC 


AAT GAT 


TTA AAT GCC 


Q K Y R T G 


D 


L M A H A T 


N D 


It N A 


ATC CAA AAT GTT GCT GGA 


GCT 


GGG ATT TTA ACG TTT GCC 


GAC TCT 


GTG ATT ACG 


I Q N V A G 


A 


G I L T F A 


D S 


V I T 


GGA GGA ACA ACG ATT ATC 


GCA 


ATG GTT CTA TTT GTC GAT 


TGG CGC 


TTA ACA TTA 


G G T T I I 


A 


M V L F V D 


W R 


Ii T h 


ATT GCT TTA TTA CCG CTG 


CCT 


TTA TTA GCC GTC ACT TCA 


CGA GTT 


CTA GCT TCT 


I A Xi I> P L 


P 


L L A V T S 


R V 


L G S 


AAG TTG CAT GAT GCC TTT 


CGA 


GAT TCA CAA GCT GCT TTT 


TCA GCG 


ATT AAT GAT 


K L H D A F 


R 


D S Q A A F 


S A 


I N D 


AAA ACA CAA GAA AGT ATT 


ACT 


GGA ATT AAA GTC ATT AAA 


ACA TTT 


GGG CAA GAA 


K T Q E S I 


T 


G I K V I K 


T F 


G Q E 


AAA GAA GAT CTT GCT GAT TTT ACA GAG AAA ATT GAT GAT 


GCT ATC 


GTC AAA AAT 


K E D h A D 


F 


T E K I D D 


A I 


V K N 


AAA CGA ACG AAC TTT TTA GAT GCA CTA TTT GAT CCG TTT ATT ACG 


CTG ATT ATT 


K R T N F I* 


D 


A h F D P F 


I T 


Ii I I 


GGT GTT TCT TAT GTT TTG ACA ATT ATT ATT GGT GGT CGT TTC . ATC ATG GAA GGA 


G V S Y V Ii 


T 


I I I 0 0 R 


F I 


MEG 


ACG ATT AGT TTA GGT CAG TTG GTC TCT TTC ATT GCC TAT 


ATT GGG 


ATG CTA GTT 



TI SLGQLVSFIAYIGMLV 
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TGG CCA ATG TTT GCG ATT GGG CGC TTA TTT AAT GTG TTA GAA CGT GGG AAT GCT 
V P V "f " "a I " "o R L F N V L E R G N A 
AGT TAC GAT CGT GTG AAC GAA TTA TTA CAT GAA AAA ACG CAT ATT ATC GAA CGA 
"s" Y V r" V N E fa fa H E K T H I I E R 
AAA GAT GCC ATC AAA ACA ATG GCG CAA GGG ACA ATT TCA ATG AAG ATT GAT TCT 

"k "d a T "k"* "t" "m" aqgtismki ds 

TTT TCT TAT CCA AAA GAA GAG ACG GTG GCG TTG GAA AAT ATC CAA TTT TCG TTG 
"p" S~ V P K E K T V A L E N I Q F S L 
CAA GAA GGA GAA ACC TTG GGG ATT GTT GGC AAA ACA GGC GCT GGT AAA ACC ACT 

Q "e" g" e" t' I G I V G K T G A G K T T 
ATT TTG AAA TTG TTG ATG CGT GAA TAT GAC CAA TAC CAA GGA ACG ATT TCC TTT 
T "iT K _ 1' h M RE YD Q Y Q G T I S F 
GGA AAA CAT AAC ATT AAA AAT TAC ACA TTA GAT GCA TTG ATG GGT GCA ATG GGG 

"g'khni knytldalmgamg 

TAC GTG CCA CAG GAC CAT TTT CTC TTT TCC ATG ACG GTA CGA GAT AAC ATT CGT 

V V V Q D H F L F. S M T V R D N I R 
TTT GCT AAA CCA CAC TTG GAA CAA GCA GCA GTT GAA CAA GCG GCA GCA TTA GCA 

FAK PHLBQAAVEQAAALA 
TTT ATT AAC CAA GAA ATT AAA GCA TTC CCT GAA GGC TAT GAC ACA ATG GTT GGG 
T T "n" Q E I K A F P B G Y D T M V G 
GAA CGT G 3 1 
E R 
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5 1 ATG AAA AAG TAC GTG AGT TCG CTA AAA AGT TCG TTT CAT TTC GTG TAT AAT AAG 
MKKYVSSIiKSSFHPVYNK 
AGA AAG AAA AAA ATC CAT TTA ATT AAG GGA GTA ATT TTT GAA TTC TTG GTT CGC 

RKKKIHIiIKGVIPEFLVR 
TTA GGG TTT AAT AAT GGT AGA GAG AAA AGG TTG GAG GCA AAA GAA ATG AGC TTA 

LGFNNGREKRLBAKEMSL 
ACA ATT GAA CAT TTA ACA GGA GGT TAC GGC CAT ATT CCT GTC TTA AAA GAT ATT 

TIEHLTGGYGHIPVLKDI 
AAT TTT GAC GTC AAG TCT GGT GAA ATG GTT GGT TTA ATT GGC TTG AAT GGT GCG 

NFDVKSGEMVGLIGLNGA 
GGG AAA AGT ACC ACC ATC AAA AAT ATT ATT GGG TTA CTT ACQ CCG CAA AAA GGC 

GKSTTI KNI IGLIiTPQKG 
AAA ATT ATG ATT GAC GGG GAA ACA TTG CAA CAA GCC CCT GAA GAA TAT CGT AAA 

K I M I DGETLQQAPEEYRK 
AAA ATC GGG TAT ATT CCG GAA ACA CCT TCT TTA TAT GAA GAA TTA ACA TTG AAA 

KIGYI PETPSLYEBLTLK 
GAG CAT ATC GAA GTA ACQ GCC TTG GCT TAT GAT ATT CCA TTA GAA GAA GCG TTC 

EHIEVTALAYDIPIiEEAF 
AAA CGA GCA GAA CCA TTA CTA AAA ACG TTC CGT TTA GAC AAT AAA TTG GAA TGG 

KRAEPLLKTPRLDNKLEH 
TTT CCT GCT AAT TTT TCA AAA GGC ATG AAA CAA AAA GTC ATG GTA CTT TGT GCA 

FPANFS KGMKQKVMVLCA 
TTC TTA ATT GAA CCG AGT TTA TAT ATT ATT GAT GAA CCT TTT CTA GGC TTA GAT 

FLIEPSLYIIDEPFLGLD 
CCT TTA GCA ATT CAT GCT TTA TTA GAA TTA ATG GAT ACG ATG CGT AAG CAA GGG 

PLAI HALLELMDTMRKQG 
GCA GCG ATT TTA ATG TCC ACG CAT ATT TTA GCA ACA GCT GAA AAA TAT TGT GAT 
AAILMSTHI LATAEKYCD 
CGT TTT GTG GTG CTA CAC GAA GGG AAA TTA CGG GCA AAC GGT .ACA ATG GCT GAA 

RPVVIiHE GKLR ANGTMAB 
TTA CGT GCA GAA TTT AAT TTA CCA GAG TCT TCT TTA GAT GAT ATT TAT CTT GCC 
LRAEPNLPESSLDDIYLA 
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TTG ACG AAG GAA QAA AAG GTG GGG TAA 3' 
LTKEBKVG* 
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5 1 ATG AAA AAA TTA AGT ATT CAT CTG AAA GAC GTA ACT ATT CAC TTT TCG GGA AAA 

M KKLSIHLKDVS IHFS6K 
CCA ATT CTA GAA ATT GAT GAA TTA TTT GTA TAT GAA AAT GAA AAA ATT GGG ATT 

PILEIDBLFVYENEKIGI 
ATT GGT AAA AAC GGC GGA GGA AAA TCA ACA CTG CTC AAT TTA ATT ATG GGT AAG 

IGKNGAGKSTLLNLIMGK 
ATT CAA TCA GAT AAA GGA AAA GTT CAA AGA TTG AAC GAC TTT CAT TAG TTG GCA 

I QSDKGKVQRLNDFHYLA 
CAA GTA GCT GAA GAA ATA ACG AAT GAA TCA GAG AAA ACT GAC AAA AAT TGT TTG 

QVAE EI TNBSEKTDKNCL 
CTA AAT CAG AAA AAT CAA AAA CTA AGT GGT GGC GAA AAA GTC CAA AAA CGC TTA 

LNQKNQKLSGGEKVQKRL 
GCA ACA TTA TTT TCA GAG TAT CCA ACC GGG GTT ATT TTA GAT GAA CCA ACC ACA 

ATLFSEYPTGVI LDBPTT 
CAT TTA GAT AAA GAA CAT CGT CAG TTG TTA GTG GCA GAT TTA ACG TAT TAT TAT 

HLDKEHRQLLVADLTYYY 
GGG ACC GTT TTG TTT GTT AGT CAT GAT CGC TTT TTC TTG AAT CAA TTA GCA GAG 

GTVLFVSHD RFFLNQLAE 
AAA ATT TGG GAA GTT TCG GAT GGA CAT GTC AAA GAA TAT TTA GGG AAC TAG GAT 

KIWEVSDGHVKEYLGNYD 
GCG TAT TOT CGT CAA AAA GAA TTG GAA CAG CAA ACA CAA TAT AAT GTC TAT CAT 

AYCRQKELEQQTQYN VYH 
CAG TAT CAA AAG GAA AAG AAA AAT TAG AGG AAT CTT ACG CAA AGA AAC AAG CAC 

QYQKEKKNYRNLTQRNKH 
AAG CGC AAA AAT CTA GTC ATG TTT CAA AAA AAC AWA AWC AAA AGC ARA TTA AAC 

KRKNLVMFQKNXXKSXLN 

CTA GTC GTT TAG 3' 

L V V * 
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5» ATG ATT TTA TTA CAA GCA AAT CAA GTT GCC CGG CAT TTT GGC TCG GAA ACA TTG 

MILLQANQVARHPGSETI* 
TTT GAA AAC ATA CAT TTA GAA ATT GCA ACA AAA AGT CGG ATT GCC TTA GTT GOT 

PENIHLEIATKSRIALVG 
CGT AAT GGT GCT GGA AAA TCG ACT TTT TTA AAA ATC ATT GCA GGC ATT GAT GCT 

RNGAGKSTFLKIIAGIDA 
CCC GAT AGC GGA ACC ATT GCC AAA AAT AAA ACT GCT ACG TTA GGT TAT TTA GCT 

PDSGTIAKNKTATLGYLA 
CAA AAT ACC GGC TTA GAA TCA GAT AAA ACC GTT TGG GAA GAA ATG ACA AAA GCC 

QNTGLESDKTVWEEMTKA 
TTT GCT GAC ATC CTA GAA ATG GAA CAG CGT ATG CGA GAA TTA GAA ACT AAA ATT 

PAD ILEMEQRMRBLETKI 
AGT GAA ATG GAG CCA ACC ACT TCC GTT TAT GAA GGA ATT TTA AAA GAG TAC GAT 

SBMEPTTSVYEGILKEYD 
CAA TTG CAA CAT ACG TTT TCT GAA AAA AAT GGC TAC GGC TAT GAA AAT GAA ATT 

QLQHTFSBKNGYGYBNEI 
CGC TCA GTC CTT CAC GGC TTT GGC TTT GAT GAA TCC TTT TAC ACG AAA GAT ATT 

RSVLHGFGFDESFYTKDI 
CAA ACC TTA TCT GGT GGT CAA AAA ACC CGG CTT GCA TTA GCG AGA ATG CTT TTA 

QTLSGGQKTRLALARMIiL 
CAA AAA CCA GAC ATT TTA ATT CTG GAC GAG CCT ACA AAC CAC TTA GAT ATC GAG 

QKPDILILDEPTNHLDIB 
ACG CTT TCT TGG CTG GAA TCT TAT TTG CCA AGT TAT GCC GGC GCC CTA TTA ATT 
TLSWLESYLPSYAGALLI 

GTT TCC CAC GAT CGT TAT TTT TTA GAT AAG GTA GTT AAT GAA GTT TAT GAA CTG 
VSHDRYFIiDKVVNBVYEL 

AGT CGC AAA AAA ATG ACT CAC TAC AAA GGA AAC TAT TCC AAA TAC TTA GAG TTA 
SRKKMTHYKGNYSKYLEIi 

AAA GCA GAA CAA TTA GCC AGT GAA TGG AAA GCG TAT GAA AAG CAA CAA GAA GAA 

KAEQLAS ewkayekqqee 

ATC AAT AAG TTA GAA GAT TTC GTT GCC AAA AAT CTG GTT CGT GCA TCT ACA ACG 
INKLBDFVAKNLVRASTT 
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AAA CGT GCA CAA AGT CGC CGA AAA GTA TTA GAA AAA ATG GAC CGT TTA GAC CGA 



K R 


A Q S R 


RKVLEKMDRL 


D R 


CCT CAA GQA GAT GAA AAA TCG GCG CAT TTT CTT TTC GAT AGT GAA AAA GTC TCG 


P Q 


G D B K 


SAHFLFDSEK 


V S 


GGA AAT GTT GTT TTA CAA GTC GAA GAT GCC GCC ATT GGT TAC GAC CAA GAA CAT 


G N 


V V L Q 


VEDAAIGYDQ 


E H 


ATT TTA TCC GAA CCT ATT 


CAC TTG GAT ATT CGT CGC AAA GAA GCC ATT 


GCC TTA 


I Ij 


S B P I 


HLDIRRKEAI 


A L 


GTC GGA CCG AAC GGA ATT 


RRT AAA TCP ACT CTC TTG AAA TCA ATT ATT 


GAC CGC 


V G 


P N G I 


GKSTLIiKS I I 


D R 


ATT CCT 


TTC ATT AAA GGA 


fcfTT AAA IfT TTT RRP JxCC AAT (TTT TTT RTA 
Aul nrtn X xxx OOC nUv, nMl \3X X iLl win 


nnr tap 


I P 


P I K G 


SKTFGTNVSV 


G Y 


TAT GAC 


CAA GAG CAA GCC 


AAT TTA CAT GGC AAT AAA ACG GTC TTA GCG 


GAA TTA 


Y D 


Q E Q A 


NLHGNKTVLA 


E L 


TGG GAT 


GAA CAC CCA ACC 


ACA CCT GAA AAA GAG ATT CGA AGT ATT TTA 


GGC GGC 


W D 


E H P T 


TPBKEIRSIL 


G G 


TTT CTC 


TTC AGT GGA GAC 


GAT GTT GAA AAA ACG ATT CCT TTA TTA AGC 


GGT GGC 


P L 


F S G D 


DVEKTIPLLS 


G G 


GAA AAA 


GCC CGT GTG GCA 


TTA GCA AAA CTA GCG ATG GAT CGT GAC AAT 


TTC TTG 


B K 


A R V A 


LAKLAMDRDN 


F L 


ATT CTC GAT GAG CCA ACC 


AAT CAC TTG GAT ATC GAT AAT AAA GAA GTT 


TTA GAA 


I L 


D E P T 


NHLDIDNKEV 


L B 


AAT GCG CTG ATT GAT TAT GAA GGA ACC ATC CTC TTC GTT TCC CAT GAC COT TAC 


N A 


!■ I D Y 


EGTILFVSHD 


R Y 


TTT ATC AAT CGA ATT GCA ACA AAA GTT GTT GAG CTT TCT GAA AAA GGC 


AGC AAA 


F I 


N R I A 


TKVVELSEKG 


S K 


CTT TAT TTA GGC GAC TAT GAT TAT TAT TTA GAA AAG AAA CAA GAG GAA GAA GAA 


L Y 


L G D Y 


DYYLE KKQEE 


B B 


ATC GCT GCC CTC TTA GCT AAT GAA GAA GCG GCG AAA AAA CCC GAA CCA GTT ACA 


I A 


A I* L A 


NBBAAKKPEP 


V T 


GCC AAA AAT ACC TTT TAT 


CAA AAC AAG GAG CAA CAA AAA TTA CTC CGT ACT TTG 



AKNTFYQNKBQQKLLRTL 



Fig. 20B 



WO 01/79257 



PCT/USO 1/12230 



44/77 



CAA AGA AAA ATA ACA CAA GTC GAA GAA AAT CTT 6CT CAG TTA GAT ACG ACA ATT 

QRKITQVEENLAQLDTTI 
GCA CAA TTA GAA GCA CAA ATG AGT CAA CCA GAC ATT CTA GAA AAT CAT GTT GAA 

AQLEAQMSQPDILENHVB 
TTG CTG GCT TTA AAC CAA CAA TTA GAT GAA GTT CGT CAG CAA CAA GAT GAA CTA 

LLALNQQLDEVRQQQDEL 
CTT GAG CAA TGG GAA AAT TTC AGT TTA GAA TTA GAA GAA ATG GAA AAT AAC AAT 

LEQWENPSLBLBEMENNN 

TAA 3» 
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5 1 ATG AAT AAA CAA GAT GCT TGG ACA GCA GAT ACC AAT GCA AAA ATT ATT TTA CAA 

MNKQDAWTADTNAKI ILQ 
AAA TTA GGC ATT CCA ACA TTG GAA AAG AAA ATA GOT GAG CTT TCT GGT GGG CAG 

KLGlPTIiEKKI GBLSGGQ 
TTA AAA CGA GTA AGC TTG GCG CAA GTC TTA ATT GAA GOG CCT GAT TTA CTG TTA 

LKRVSLAQVLIEAPDLLL 
TTG GAT GAA CCA ACG AAC CAT CTG GAT TAT GAA ACG ATT GAA TGG CTG GAA AAC 

LDEPTNHLDYETIEWLBN 
TTT TTA AAT AAC TAT CGT GGC GCG GTT TTA ATG GTG ACC CAT GAC CGC TAG TTT 

PLNNYRGAVtiMVTHDRYP 
TTA GAT CGA GTA ACC AAT CGG ATT TTC GAA CTT TCT TTT GGA AAA CTG TAT GAA 

LDRVTNRI PELS FGKIiYE 
TAC AAA GGA AAC TAT GAA ACG TAT GTG ATG GAA AAA GCC GAA CGT GAA CGT GTC 

YKGNYBTYVMEKABRBRV 
GCT GTA GAA CAA GAA GAA AAA AGA AAA CGC CTC TTC AAA CAA GAA CTT GCG TGG 

AVBQEEKRKRLFKQBIiAW 
ATG CGA GCA GGC GTT CAA GCA CGA GGA ACG AAA CAA CAA GCA AGA ATT GAT CGG 

MRAGVQARGTKQQARIDR 
TTT CAG GAT TTA AAA GAA AAC TTG CAT CAG GTA CAA ACC AAT GGC ACC TTA GAA 

FQDLKENLHQVQTNGTIiB 
GCA GAC TTT GCC ACG CAG CGT TTA GGG AAA AAA GTT CTG GAA ATC AAG GAA GGA 

ADFATQRLGKKVLEIKEG 
AAT TAT GCC ATT GAT CAT AAA CAG CTT TTG AAA GAC TTT AAT TTA CTC ATT CAA 

NYAI DHKQ LL'KDFNLL I Q 
GCA AAC GAT CGA ATT GGC ATT ACT GGT AAA AAC GGT GCA GGC AAA TCC ACA TTA 
ANDRIGITGKNGAGKSTIi 

TTA AAT ATT TTA GCT GGA CGC ATT CCT TTA GAG AGT GGT CTA TAT AGT ATC GGT 
LNILAGRI PLBSGLYSIG 

GAA ACC GTC CGA ATT GGC TAT TAT ACC CAG CAA AAC GAA GAA ATG GAT CCT AAT 
ETVRIGYYTQQNBEMDPN 

CAA CGG ATG ATT GCT TAT TTG CAA GAA GCA GCA GAA GAA GTG AAA CGC AGT GAT 
QRNI AYLQBAAEEVKRSD 
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GGA AGC AGC GTT GGG GTC GCT GAA ATG TTA GAG CGC TTT TTA TTT CCT CGT TTT 
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5 ' ATG AGT ATT TTA ACT ATT GAA CAT TT A ACG CAT CGA TTT GGC GAA AAG GTC TTG 

MSILTI EHLTHRFGEKVL 
TAT GAA GAG GCT TCA TTG CAA GTG AAT AAA GGG GAT CAT TTA GGC TTA ACT GGC 

YEEASLQVNKGDHLGLTG 
CAA AAT GGG GTC GGC AAA TCC ACC TTA ATT AAA ATT TTA ACG GGG GAA GTA TTG 

QNGVGKSTLIKILTGEVL 
CCA GAC GAA GGA ACG ATT CAG TGG CAA AAA AAT TGC AAG ATT GGG TAC TTG GAT 

PDBGTIQW QKNCKI GYLD 
CAG CAC GTT TCT GTA GAA CAA TCA CTA ACG ATG GTT GAT TTT TTG AAA CAA GCC 

QHVSVBQSIiTMVDPIiKOA 
TTT CAA GAA CTT TTT GAT AAA GAA GCG AAA CTG ACA AAG CTT TAT GAA GAA TAC 

FQELFDKEAKLTKIiYBBY 
AGT CAA ACG GCT TCG GAA AAA CTT TTA GAA CAA GCA GGC AAG TTA CAA ACA GAT 

SQTASEKLLBQAGKLQTD 
TTA GAT GAA AGT AAT TTT TAC CAA ATC GAC ACG ATT ATT CAG GAT TTA GCC AAT 

LDESNFYQIDTI IQDLAN 
GGG TTA GGA CTA CAA GCA ATT GGT TTG GAT AAA AAG TTA GGG GAG CTA AGC GGT 

GLGLQAIGLDKKLGELSG 
GGT CAA CGT TCA AAA GTG ATT TTA GCA AAA TTA TTG TTA GAG GCC CCT GAT GTG 

GQRSKVILAKLIiLEAPDV 
TTA CTT TTA GAT GAA CCC ACC AAC TAT TTA GAT GAT ACA CAT ATT CAA TGG CTT 

LLLDEPTNYLDDTHIQWL 
GTT CGC TAT CTG AAT AAT TTT GAA GGA AGC TTT CTA TTA GTC TCC CAT GAT TAT 
VRYLNNFEGS PLLVSHDY 

CAA TTT TTA AAT GAA GTG ACG AAT TGC ATC GCA GAC ATT GAG TTT GGC AAG TTA 
QFLNEVTNCIADIBPGKL 

ACC AAA TAT ACT GGA AAT GTA GAA AAA TCT TTT GCA CAA AAA GAG CAG AAT AAA 
TKYTGNVBKSFAQKEQNK 

CAA ACC TAT TTG AAA CAG TAT CAG GCC CAA CAA GAA AAA ATT GAA AAA ATG GAA 
QTYLKQYQAQQEKIEKME 

GCC TAT ATT CGT AAA TAC AAA GCT GGA AAT CGA GCA ACG ATG GCT AAA AGT CGA 
A Y I RKYKAGNRATMAKSR 
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CAA AAA CAA TTG GAC CGG TTG GAA CGA TTG ACT CCG CCT GGT TCC TTG ACT AAG 

QKQIiDRLERLTPPGSLTK 
CCA GCG ATT GAA TTT CCT TAT CAA GGG TTA GTT GCA ACG CAA GCA CTA ACG ACC 

PAI EFPYQGLVATQALTT 
CAG AAG TTA GTT GTC GGC TAT CGG GAA CCT TTG TTA GAA CCG TTA GAT TTA ATG 

QKLVVGYREPLIiEPIiDIiM 
GTT CAT GTC GGT GAA AAA GTC GCA TTG AAA GGC TTT AAT GGG ATT GGC AAA TCA 

VHVGEKVALKGFNGI GKS 
ACA TTA ATT AAA ACG TTG ACG AAA GTG ATT CCT TCA TTA GAT GGA GAA TTT CAT 

Till KTLTKVI psldge ph 
TAT CCG CTG AAT ACA AAA ATT GCT TAT TTT ACC CAA GAC TTA GCG TGG CCT AAT 

YPIiNTKIAYFTQDLAWPN 
GAG CAG TTA ACG CCA CTA GAC TAT TTA TCA GAT CGT TTT CCA GAT ACA ACG ATT 

EQI»TPI*DYLSDRFPDTTI 
AAA GAG CGA AGA AGT CAT TTG GCT AGA GCA GGT TTA CCA GAT AAG TTA GCA ATG 

KERRSHLARAGLPDKIiAM 
CAG TCG CTA GCC CTG TTA AGT GGT GGT GAG CAA ACA AAA GTA AAA CTA GOG GAA 

QSIiALLS GGEQTKVKLAE 
CTA ATG ATG CAA ACA AGT AAT TTG TTA TTT TTA GAT GAG CCA ACC AAT CAT ATT 

LMMQTSNIiLFLDEPTNH I 
GAT GAA GCG GCC AAA AAA ATT TAC AAG AAG CAA TTC ACG TTT ACC CAG GAA CGG 

DBAAKKI YKKQFTFTQE R 
TTT TTC TGG WTT CCC ATG AAG CAG ATT TTT ATG AGG AAA TTG TGG ATC GAG TAA 3 1 
FFWXPMKQIFMRKLWIE* 
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5 ■ ATG ACT GAT GCT CTT GTG ATA CAA GAT TTA CGC AAA GTG TAC GCT TCA GGT GTT 

MTDALV I QDIiR KVYAS GV 
GAA GCG TTG CGG GGG ATT GAT TTA ACA GTC GAA GAA GGC GAT TTT TAT GCT CTT 

BALRGIDLTVE EGD PYAL 
TTG GGA CCG AAT GGC GCA GGA AAA TCA ACG ACG ATT GGG ATT GTG ACC TCG TTA 

LGPNGAGKSTT IGIVTSL 
GTT AAT AAG ACT TCG GGG AAA GTC AAA ATT TTT GGG TAT GAT TTA GAT ACG GAG 

VNKTSGKV KlFGYDIiDTE 
ATG GTG CGG GCG AAG CAA CAG ATT GGG TTA GTT CCG CAG GAG TTT AAC TTC AAT 

MVRAKQQIGLVPQEFNFN 
CCG TTT GAA ACG GTC CAG CAG ATT GTG GTC AAT CAA GCG GGC TAT TAT GGG GTG 

PFETVQQIVVNQAGYYGV 
TCT CGT AAA GAA GCC ATG AAA CGG ACT GAA AAG TAT TTA AAA CAG TCC AAT TTA 

SRKBAMKRSEKYLKQSNL 
TGG GAA AAA CGG AAC GAA CGG GCG CGA ATG CTT TCT GGC GGA ATG AAA CGT CGC 

WEKRNB RARMLSGGMKRR 
TTG ATG ATT GCT CGC GCG CTG ATG CAT GAG CCG AAG TTA TTG ATT TTA GAT GAA 

LMIARALMHEPKLLILDE 
CCA ACT GCA GGT GTC GAT ATT GAA TTG CGC CGT GAA ATG TGG GCC TTT TTA CAA 

PTAGVDI BIiRREMWAFIiQ 
GAG TTA AAT GCG CAG GGA ACC ACG ATT ATT TTG ACG ACC CAT TAC TTA GAA GAA 

EIiNAQGTTIILTTH YLEE 

GCT GAG ATG TTG TGT CGC AAC ATT GGG ATT ATT CAG TCT GGT GAA TTG ATT GAA 
AEMLCRNIGIIQSGELIE 

AAC ACC AGC ATG AAG CAC TTG TTA GCA AAA TTA CAA TTT GAA ACG TTT ATT TTT 
NTSMKHLIiAKIiQFETFI F 

GAT TTA GCA CCT TAT ACG CAG GCG CCA GTT ATT GAA GGC TAT CAA AGC GTA TTT 
DLAPYTQAPVIBGYQSVF 

GAG GAC GAG TTA ACG TTA GCG GTG GAA GTG GAA CGA AAC CAA GGA GTC AAT CAC 
EDBLTLAVEVERNQGVNH 

TTG TTT GAA CAG CTA AGT CAA CAA GGA ATC AAA GTA TTG TCA ATG CGG AAT AAG 
LFEQLSQQGIKVIiSMRNK 
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TCC AAC CGT TTG GAA GAG CTC TTC TTG AAG ATT ACC GAG GAT ACG TAC CAA AGG 
SNRLBELFLKITBDTYQR 

GAG GAT CAA CAT GTT TAG 3 1 
K D Q H V * 
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5' ATG GCA TTA ATT GAA TTA COT CAT GTA AAA AAA GAG TTT TCC GGT AAA GOG GOT 

MAL I ELRHVKKEFS GKAG 
AAA GTC ACG GCT TTA AAA GAT ATT GAT TTA ACA GTT GAA TCA GGT GAT ATT TAT 

KVTALKDIDLTVESGDIY 
GGC ATT ATT GGC TAT TCA GGT GCA GGA AAA AGT ACG TTG GTT CGC CTA, TTG AAT 

Gil GYSGAGKSTLVRLLN 
GGA CTT GAA ACA CCT ACA GAA GGT GAA GTA GAA ATT CAA GGA CAA GAT ATT GCA 

GLE T PTBG BVBI QGQDIA 
TTG TTG CCA AAT AAA GAA CTA CGA AAC TTC CGT AAA AAA ATT GGG ATG ATC TTT 

LLPNKELRNFRKKI GMIF 
CAA CAT TTT AAT TTA TTA TGG TCA CGG ACG GTC CTA GAA AAT ATC ATG CTC CCA 

QHFNLLWSRTVLENIMLP 
TTG GAA ATT GCT GGC GTT CCA AAA CAA AAT CGG AAA AGC CGC GCA GAA GAG TTA 

LEIAGVPKQNRKSRAEEL 
ATC AAG TTG GTT GGT TTA GAA GGA CGG GAA ACC GCC TAC CCA AGT CAA CTC TCT 

I KLV GLEGRETAY PSQLS 
GGT GGG CAA AAG CAA CGA GTG GGG ATT GCC CGT GCC TTA GCT AAC AAT CCT GAC 

GGQKQRVGIARALANNPD 
ATT TTG CTT TOT GAT GAA GCA ACG AGT GCT TTG GAT CCG CAG ACA ACT GAT GAA 

ILIjCDEATSALDPQTTDE 
GTG CTA GAA TTA CTA CTA AAA ATC AAC CAA GAA TTA AAT TTA ACG GTT GTA TTA 

VLELLLKINQELNIiTVVL 
ATT ACC CAC GAA ATG CAC GTG ATT GGT AAA ATT TGT AAT CGC GTA GCT GTG ATG 
ITHEMHVI RKICNRVAVM 

GAA TAT GGT GAA ATT GTT GAA GAA GGT AAA GTG ATC GAT ATT TTC AAA AAG CCT 
EYGE IVEBGKVIDI FKKP 

CAA ACA GAA ATT GCG AAA CGC TTT ATC CAA CAA GAA GCG GAT AAA AAC ATT GAA 
QTEIAKRFIQQBADKNIE 

GAA ACG GAA CTG GTT GTT GAA GAA ATG TTG GAA CAA TAT CCG AAT GGA AAA ATT 
ETELVVEBMLEQYPNGKI 

GTT CGC CTG CTT TTC CAC GGT GAA CAA GCG AAA TTG CCA ATT ATC TOG CAT ATT 
VRLLFHGEQAKLPI ISHI 
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GTC CAA GAA TAT CAA GTA GAA GTT AGT ATT ATT CAA GGG AAC ATT CAG CAA ACA 

VQEYQVEVSIIQGNIQQT 
AAG CAG GGT GCA GTG GGT TCT CTT TAT ATT CAA TTA TTA GGC GAA GAG CAA AAT 

KQGAVGSLYIQLLGBEQN 
ATT CTA GCA GCC ATT GAA GGA TTA CGT AAA CTT CGT GTA GAA ACA GAG GTG ATT 

I LA AIEGLRKLRVETEVI 
GGA AAT GAA TAA 3 » 

G N B * 
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5 ' ATG GOT CCT AAT GAG CGA CTA GCA ACG CTG AAA CAA AAT CAC TTT GAC TAC GAA 

MGPNERLATLKQNHFDY B 
GAC TAC ACT GTT TTA GAA ACT GTA ATT ATG GGA CAT AAA CGT CTT TAC GAA GTA 

DYTVLETVIMGHKRLYBV 
ATG AAA GAA AAA GAT GCT ATC TAT ATG AAA GAA GAT TTT TCA GAT GAA GAT GGG 

MKEKDAIYMKBDFSDEDG 
ATT CGT GCC GCA GAA CTA GAA GGC GAA TTT GCT GAA CTT GAC GOT TGG GAA GCA 

IRAABLEGBFAE LDGWEA 
GAA CCT GAA GCA GCT GTT TTA CTC CAA GGG CTA AAC ATT CCA GAA GAA TTA CAT 

EPEAAVLLQGLNIPEELH 
GAT CAA AAA ATG AGC GAA TTA ACA GCT GGT CAA AAA GTC AAA GTA TTA TTA GCT 

DQKMSBLTAGQKVKVLLA 
CAA TCT CTT TTT GGT AAA CCA GAT GTC TTA CTA TTA GAC GAG CCG ACA AAT GOT 

QSLFGKPDVLLIiDEPTNG 
TTA GAC ACC CGC TCA ATC AAT TGG TTA GAA GAA TTT TTA ATC AAC TTT GAA AAT 

LDTRSINWLBEFLINFEN 
ACC GTT ATC GTG GTT TCC CAT GAC CGT CAT TTC CTA AAC AAA GTC TGC ACT CAC 

TVIVVSHDRHFLNKVCTH 
ATG GCA GAT TTA GAC TTT ACT AAA ATC AAA CTT TAC GTT GGT AAC TAT GAT TTC 

MADLDFSKI KLYVGNYDF 
TGG TTG GAA TCA AGC CAA CTA GCG ACA AAA TTG CAA GCA CAA TCT AAT GCC AAA 

WLESSQLATKLQAQSNAK 
AAA GAA GAA CAA ATC AAA GAA TTA CAA GAC TTT ATC GCT CGT TTT AGC GCC AAT 

KEEQIKELQDFIARFSAN 
GCC TCA AAA TCA AAA CAA GCA ACG TCT CGT AAA AAA ATG TTA GAT AAA ATT ACC 

ASKS KQAT SRKKMLDKIT 
TTA GAT GAC ATT CAA CCT TCC TCT CGT CGT TAT CCA TTC GTT GGT TTT ACA CCA 
LDDIQP S9RRYPFVGFT.P 

GAA CGT GAA ATC GGC AAT GAC TTA TTA CAA GTT GAA AAT GTC TCT GTA ACG ATT 
EREIGNDLLQVENVSVTI 

GAT GGC AAA AAA ATC TTA GAT AAT ATC TCG TTC AAC TTA ACA AAA GAT GAT AAA 
DGKKILDN I SFNLTKDDK 
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GTG GCA TTC ATC GCT GAC TCA GAT ATT ACA ACQ ACT ACG TTA TTC AAA GTG ATT 

VAFIADSDITTTTLPKVI 
ATG GGC GAA ATT ACT CCC GAT ACA GGT TCT GTT CGT TGG GGC GTT ACA ACT AGC 

MGEITPDTGSVRWGVTTS 
CAA GCT TAT TTA CCA AAA GAC AAC AGC AAA GAC TTT GAA GAG CCA TTA ACC ATT 

QAYLPKDNSKDFEEPLTI 
TTA GAT TGG TTA COT CAA TTT GCT GGT AAA GAA GAA GAT GAC AAT ACG TTC TTA 

IiDWLRQFAGKBEDDNTFL 
CGT AGT TTC TTA GGT CGG ATG TTA TTC TCT GGT GAA GAG GTA CTA AAA CCA GTC 

RSFLGRMLFSGEEVLKPV 
AAT GTT CTT TCC GGA GGC GAA AAA GTG COT GTC ATG CTT TCA AAA TTA ATG CTT 

NVIiSGGEKVRVMLSKLMI* 
TCC AAA GCC AAT GTC TTA GTT TTA GAT GAT CCA ACG AAC CAC TTA GAC TTA GAA 

SKANVLVLDDPTNHLDLE 
TCA ATC ACT GCA TTA AAT GAT GGG TTG ATG GCT TTC ACT GGT TCA ATT CTT TTT 

SITALNDGLMAFTGS ILF 
GCT TCA CAT GAC CAC CAA TTT ATC CAA ACA TTA GCG AAC CGA ATT ATT GCT GTT 

ASHDHQFIQTLANRI IAV 
TCT GAT AAA GGG GTC ATT GAT CGG GCA GAA ACA ACC TAT GAT GAA TTC TTG GAA 

SDKGVIDRABTTYDE F L E 
AAC CCA GAA ATT CAA AAA CAA ATG GAC GTT TTA TTC AGT TCA GAT TAT TAA 3 1 

NPBIQKQMDVLFSSDY* 



Fig. 25B 
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5 1 ATG TCG AAA ATT GAA CTA AAA CAA CTA TCT TTT GCC TAT GAT AAT CAA GAA GOG 
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AAA CTA TTA AAA GAT CTT GAG TAT ATT GAT TCT TTG TCA ATG GAT TAT CAG CCA 
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5' ATG CCG TTT ATT CCA GTA TAT GTT GAA CAA TTA GGG ACG CCT AAG AGT CAA GTA 
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TTT TCA ACA GGT GCT TTA ATG CCA TCA ATT AAT ACQ TTA ATT AGT AAA ATA ACQ 
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5 • ATG GCT GTT TTT TTA TGG AGG AGG CAA ATT ATG ACX3 AAA AAA AAT AGT ATG ATG 
M " "a" V F LWRRQ I MTKKN SMM 
TAG TTA GCA ATT TCT AAC TTA TTT CTT GTT TTT CTA GGC GTA GGC CTA GTC ATT 

V "l" A I S N L P L V P L G V G L V I 
CCC GTA ATT CCC CAA TTA AAA GAA GAA ATG CAT TTT TCT GGT ACC ACA ATG GGA 
"p V I pQLKEBMHFSGTTMG 
ATG ATG ATT TCT ATT TTT GCG ATT GCC CAA TTA ATC ACA TCG CCA ATC GCA GGT 

M M I S I FAI AQL I T S P I AG 
GTC CTT TCG GAT AAA ATT GGT CGG AAA AAA ATG ATT GCA ACG GGC ATG TTG GTG 

V ~L "s" K I G R K K M I A T G M L V 
TTT TCA ATT TCT GAG TTA TTA TTT GGT TTA GCC CAA GCG AAA AGC GGT TTT TAT 
"p" "s X SELIiFGLAQAKS GFY 
ATT TCT CGT GGT TTA GGC GGG ATT GCC GCC GCT TTA TTA ATG CCG TCA GTG ACA 

V "s" "r" "q" "l" G G I A A A h L M P S V T 
GCC TTT GTG GCA GAT ATG ACC ACG ATT TCT GAA CGT CCG AAA GCG ATG GGG CTT 

A FVADMTTI SERPKAMGL 
GTG TCA GCC GCA ATT AGT GGT GGT TTT ATT ATC GGA CCA GGA GTT GCT GGT TTT 
"v' s AAISGGFIIGPGVGGF 

ATT GCT TAT TTA GGT ATT CGC GCC CCG TTT TTT GCA GCC GCA TTT TTA GCG TTT 

IA YLGIRAPFFAAAFLAF 
ATT GGT TTT ATT TTG ACA TTA ACT GTT TTG AAG GAG CCA GAG AAA CGA ATT TTA 

I 6 F I LTLTVLKEPEKRII* 
GCC GCT GTT GAA GCG AAA AAA GGT TCA TTT ATG GAT ATT TTA AGA AAT CCA ATG 
~A~~AVEAKKGSFMDI LRNPM 
TTT ACC TCA TTA TTT GTG ATT ATC TTA ATT TCC TCT TTT GGC CTG CAA GCG TTC 
FTSLFVIILISSFGLQAF 

GAA TCT ATT TAT AGT ATT ATG GCG ACT ATT AAT TTT GGC TTT ACC ACA AGT GAA 
"r" S I Y S I M A T I N F G F T T S E 
ATA GCA ATC GTG ATT ACG GTT AGT GGT ATT TTA GCG TTG ATT TGT CAG TTG TTT 

IAIV ITVSGIIiALICQLF 
TTC TTT GAT GCA ATC GTC CAA AAA ATA GGT GAA ATG GGT TTA ATC CAA TTA ACC 
V F~ D "a I V Q K I G B M G Ii I Q L T 
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TTT TTT GCA AGT GCC ATT TTT ATT GCC GTG ATT GCC TTT ACA AAA AAT AAT TTA 
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5 » ATG ATC GAT AGA AAA AAA GTT ATT TTG TAT ACT TOT TGC ATG AGT TTG TTT GTA 
~M~ I DRKKVII*YTCCMSbFV 
GTG ACC ATG GAT GTT ACG GTT GTC AAT GTG GCG TTA CCA TCC ATT CAA AGT GAT 

VTMDVTVVNVALPS I QSD 
TTT CAC ACG AAT CTG TCT ACA TTA CAG TGG GTA ACA GAT GGC TAC. ACT TTA ATG 

FHTNLSTLQWVTDGYTIiM 
GTA GCA TCC TTA TTA TTG TTG TCT GGG TCT ACA GCA GAT CGA ATT GGC CGT AAA 

VAS LL LL S GS TAD R I GRK 
CGA GTC CTT CAA TTG GGC TTA GCC TGT TTT GGT TTA GCG TCT TTC CTA TGT GGG 

RVLQLGLACFGLASFIiCG 
ATT TCG CAA ACC CCA GGG CAA TTG ATC GCG TTT CGC ATG TTG CAA GGG ATT GGT 

ISQTPGQLIAFRMIiQGIG 
GGT TCT ATG TTA AAT CCG GTA GCA ATG TCT ATT ATT ACA CAG GTA TTT ACC GAA 

GSMLNPVAMSIITQVFTB 
AAG TTA GAA AGA GCG AAA GCA ATT GGC TTG TGG GGC TCT GTT ACA GGG ATT TCC 

KLBRAKAI GLWGSVTGIS 
TTA GGC ATG GGC CCG ATT ATT GGT GGA CTG ATT GTT TCT TAT TTT AGC TGG CGG 

LGMGPIIGGLIVSYFSWR 
TAT GTC TTT TTT GTA AAT GTA CCG ATC ATT GCT GCG GCA ATC ATC CTT ACA CAA 

YVF FVNVPI IAAAI I LTQ 
AAG TTT GTA CCT GAG TCA AAA GTA GAG AAG ACG GTG AAA AAT GAT TTT GTT GGT 

KFVPBSKVEKTVKNDFVG 
CAA GCA TTG ATG ATT CTT TTT CTA TTT AGT TCT ATC TAT TCC ATT ATC GGA CTA 

QALMILFIiFSSIYSIIGL 
CCT AGA AAA GGG CTT TTC GCG CCA GAT ATT TTA AGT ACT GGG ATA ATC GGC TGC 

PRKGLFAPDILSTGI IGC 
TTA GCC ATT GTT ATT TTC TTT ATT TAT GAA TAT AAC ATT GAC AAT CCG TTA ATC 

L A I V I PFIYEYNIDNPLI 
AAC CCG CGT TTC TTT TTA TCT ATT CCA TTT ACA TCG GCT TCT TTT TTA GCT ATT 

NPRFFLSI PFTSASFLAI 
TTT GGC TTT ATC ATA TAT AAC GGC TAT TTA TTT TTA AAC ACG CTA TAT TTG CAA 
FGFIIYNGYLFbNTLYLQ 
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GAG ATG AGA GGC TTC AGC CCG TTG GAA GCT GGC TTA TCA ACC ATT CCT TTG GCT 
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5' ATG GCA AAA GAA ACA AAT GTT AAG TTA GTC ACG GTG AGT GTT TTT GTG GCA ACA 
MAKBTNVKLVTVSVPVAT 
TTT ATG ACA GCC ATT GAA GGG ACC ATT GTG TCT ACT GCG ATG CCA ACG ATT GTC 

FMTAI EGTIVSTAMPTIV 
GGC TCG TTA CAT GGC ATG GAA ATT ATG AAC TGG GTA TTT TCA ATT TAT TTA TTA 

GSLHGMEIMNWVFSIYLL 
ACG AAT GCG ATG TTA ACA CCG ATT TAT GGG AAA CTT GCG GAT AAA ATT GGT CGT 

TNAMLTP I YGKLADKI GR 
AAA CCT GTC TTC ATG ATT GGC ATT ATC ATT TTT ATT TTG GGC TCC TCG TTG TGT 

KPVPMIGIIIFILGSSLC 
GGC TTT GCT CAA GAT ATG TTG ACT TTA ATT ATT GCC CGC GCA ATT CAA GGT GTG 

GFAQDMLTLI IARAI QGV 
GGG GCA GGC GCA ATT TTA CCA GTT GCG TTA ACG ATT ATT GCC GAT ATG TAT ACA 

GAGAI LPVALTI IADMYT 
TTG GAC AAG CGA GCG AAA ATT TTA GGT TTA AAC AGT GCC GCC TGG GGA ATT GCT 

LDKRAKI LGLNSAAWGIA 
AGT ATT TTT GGT CCG TTA GCA GGT GGT TTT ATT GTA GAT ACA GTC GGT TGG CAT 

SIFGPLAGGPIVDTVGWH 
TGG ATT TTC TTC ATT AAT GTT CCT ATT GGA CTT GTT TTA TTG GGC TTG ATT AGT 

WIPFINVPIGLVLIiGIjIS 
ATT TTC TTA GTT GAA CCA AAG CGG GAA CGG ACC AAG ATG CCA ATG GAT ATT TTG 

IFLVEPKRERTKMPMDIL 
GGC AGC GTT ACT TTG ATG GTA GTG CTG CTA ACG TTA TTG CTA GGT TTT CAA ATG 

GSVT LMVVLLTLLLGFQM 
ATT AGC GAT AAT GGT TTT ACA TTA GTA ACA TTT GGT TCT TTA AGT TTG AGT GTG 

ISDNGFTLVTFGCLSltSV 
CTC TTT TTT GTA GCA TTT GTG ATG ATA GAA AAA CGC GCG CAA GAC CCA GTG ATT 

LFFVAFVMIBKRAQDPVI 
GAT TTG CAT TTA TTT AAT CAA CCA ACG TTT GTT TTA GTA AAT CTT ATT GCA GCG 

DLHLFNQPTFVLVNL IAA 
CTT ATT AGC GGT TTC TTA ATG GGG ATT GAT GTC TAC ATT COG ATG TGG ATG CAA 
LISGFLMGIDVYI PMWMQ- 
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GOT GTC TTA GGA AAA ACT GCA GGA ATT GGT GGC TTA GTT TTA GCG CCT ATG TCG 



G.VLGKSAGIGGLV 


L A P M S 


TTA CTT TGG ATG GCT GGA TCA TTT ATA GCA AGT AGT TTT ATG GAA AAA TAT GCT 


LLWMAGSFIASSF 


M E K Y A 


ATG AAA AAA GTC TTA ACG ATT GGG TTA TCG ATT CTA TTA 


GTC GGG GCC ATC TTT 


MKKVLTIGLSILL 


V G A I F 


TTA GTG GTA ATG CCA ATG GCC GTT CCG TTT TGG CTT TTC 


TTT GTA GTG TCT TCT 


LVVMPMAVPPWLF 


F V V S S 


GTC TTA GGA GTT GGT TTT GGG ATT. ACT ATC ACA ACG ACT 


ACG GTG ACA GCA CAA 


VLGVGFGITITTT 


T V T A Q 


AGT ACA GTG GAG CCT GAA AAA ATG GGG GTT GCA ACA TCG 


TTT AAT ACG TTG GTG 


STVEPE KMGVATS 


F N T L V 


CGT ACA ATT GGG CAG ACT GTG ATG GTG TCA ATT TTT GGT 


GTG ATT TTA AAT GCA 


RTIGQTVMVSIFG 


V I L N A 


GGA ATG TTT GCG AAA TTG GAA GCG AGC GCG TTA AAC GTC 


GAT GCA GAT GTC ATG 


GMFAKLEASALNV 


D A D V M 


AAT CAA GTA GTG AAT CCA CAT ACT GCA AAT TTA ATT CCA 


GCT GCG TTG TTA AAA 


NQVVNP HTANLIP 


A A L L K 


CCA TTA CGC GGT ATC CTC TAT GCA GGT CTG CAT AAT GTT 


TAC TTA GTC GGT GCG 


PLR-G1 LYAGLHNV 


Y L V G A 


GGC TTA GTT GTT GTC GCT CTT TTG TTA AAT ATT TTC GCA AAA GCG CAA CGA GCG 


GLVVVALLLNI FA 


K A Q R A 



AAG GTT TAG 3' 



K V * 



Fig. 30B 



WO 01/79257 



PCT/US01/12230 



65/77 



5 1 ATG GCT GGT ATT GGC TTT AGC TTA GTG ATG CCA TTT ATQ CCT CTT TAT ATC AAC 

MAGIGFSLVMPFMPLYIN 
ACA TTA GGT ACT TTT ACC CAC CAA CAA TTG AAT TTT TGG AGC GGA ATC ACG TTC 

TLGTFTHQQLNFWSGITP 
TCT TCC ACT TTT TTA GTC ACA ACC ATT GTT TCA CCT TGG TGG GGC CGT TTA GCC 

SSTFLVTTIVSPWWGRLA 
GAC CGA AAA GGA CGC AAG TTA ATG TTA TTA CGC GCT TCA TTA GGG ATG GCT ATC 

DRKGRKLMLLRASLGMAI 
GTC ATT AGT TTA ATG GGC GCA GTC ACA AGT GTT TAT CAA TTG ATT GGC TTA CGC 

VISLMGAVT .SVYQLIGLR 
TTA TTA CAA GGT GTT TTT TCT GGT TAT ATC AGT AAT GCG ACT GCG TTA GTT GCT 

LLQ GVFSGYI SNATALVA 
ACA GGC ACA CCT AAG GAA AAA AGC GGT CAA GTG CTT GGT ACT TTA GCC ACT GGT 

TGTPKEKSGQVLGTLATG 
TCT GTT ACC GGC ACG TTG CTT GGT CCT CTT TTG GGC GGT GTC ACT GCG TCT ATT 

SVTGTLLGPLLGGVTAS I 
TTC GGT TAT CGG CCA ACC TTT TTC ATT ACT GGT ACG ATT TTA TTA CTC GTT TTT 

FGYRPTFFITGTILLLVF 
GTT TTG AGT CTC GTC TTT GTC CAT GAA GAG TTT GTG CCA ATT GAA AAA AAT CAA 

VLSLVFVHEEFVPIEKNQ 
GCC GCA TCT GGG AAA CAA ATT TTA AAA AAA CTA GAA CAT CCC CAC GTG ATT CTT 

AASGKQIIiKKLEHPHVI L 
GGA ATG TTT ATT ACG ACA TTA ATC ATT CAA GCT TCC AAT AAT TCA ATT AGT CCC 

GMFITTLIIQASNNSISP 
ATC ATC AGT TTA TAT ATT CAA CAA TTG TTG GGT GGT CAC GGA AAT GTC ACC TTA 

I I S LY I QQLLGGHGNVTL 
ATT AGC GGA GTC ATT GCT TCT ATT CCA GGG ATT GCA ACG TTA ATA GCC GCC CCT 
ISGVIASIPGIATLIAAP 

CGT TTC GGC CGG TTA GGC GAT CGT ATT GGC AGT GAA CGC ATC TTG ACA ATT GGC 
RFGRIiGDRIGSERIIiTIG 

TTA ATT TTA GCC ATT TTT GTT TAC CTA CCA ATG GCC TTT GTT CAA AAT GTC TGG 
L I L AI FVY LP MA FVQ NVW 
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CAG CTT GCC ATG CTT CGT TTC TTA GTT GGG ATT TCC GAT GCC TGT TTA CTA CCA 

QLAMLR PLVGISDACLLP 
GCT GTT CAA ACG TTG ATC ACT CGT TAT TCC CCA AGT GAC GCT GCA GGT CGT ATT 

AVQTLI TRYSPSDAAGRI 
TTC AGT TAG AAT CAA TCT TTC CAA GCG ACA GGA AAT GTG ATT GGT CCT ATG ATT 

FSYNQS PQATGNVIG PMI 
GGT TCA AGC GTA TCT GCC GCT TTC GGT TAT CGT GGG GTC TTT ATT TCC ACT TCT 

GSSVSAAFGYRGVPISTS 
TGT CTG GTT CTC CTT AAC CTT CTA TGG GTT CGT CGA AGC ACC GCC GAA TTA AAA 

CLVLLNIiLW.VRRSTAELK 
AAA GAG AAA AAT GAT GAC TAA 3 • 

K E K N D D * 




WO 01/79257 PCT/US01/12230 . . 

67/77 

5' ATG CAA TTT AAT TTG AAA CTT ACA QAA AAA GGA GCG ATA TAT GTG ACT TCA CAA 

MQFNLKLTEKGAI YVTSQ 
CAA CCT GTG GAT ATT CAC GGA AAA CCT TAT AAC CGG TCG TTA CTA GTC GGT GTT 

QPVDIHGKPYNRSLLVGV 
TTA CTA ATC GGA ACA TTT TGT ACG ATT TTA AAC CAA ACG CTT TTA ACG ACQ GCC 

LLIGTPCTILNQTLLTTA 
TTA CCT ACA TTA ATG AAA GAA TTT GAT ATC TCA GCT TCA AGT GTA CAG TGG CTA 

LPTLMKEFDISASSVQWL 
ACC ACT GGA TTC CTA CTT GTG AAC GGC ATT ATG ATT CCC ATC AGT GCT TGG TTG 

TTGFLLVNGIMIPISAWL 
ATT AAC AAA TTC AGT TCA AAA AAA TTA TAC ATA ACA GCG ATG TCA ACG TTC TTA 

INKFSSKKLYITAMSTFL 
ATC GGA ACC ATC ATC TGT TTT GTA GCG CAA GAC TTC GGC ATG TTA TTG ACT GGA 

I G T I I CFVAQDFGMLLTG 
CGT CTT GTT CAA GCC GCA GGG GTC GGT GTA TCT ATG CCT CTT TTA CAA ACA ATT 

R I* V. QAAGVGVSMPIiIiQTI 
ATG CTT TCA ATT TTC CCG CCT GAA AAA CGT GGC GCT GCC ATG GGC ACA ACG GGA 

MLSIFPPEKRGAAMGTTG 
ATT GTT ATC GGA TTA GCA CCC GCT TTA GCT CCA ACA TTA TCT GGT TGG ATT ATT 
IVIGLAPALGPTLSGWII 

GAT TCT TAT ACA TGG CGT GAC CTT TTT GGG ATG GTC ATT CCA ATT GTC GTT TTA 
DSY TWRDLFGMVIPIVVL 

GTC CTA ATT TTA GCT TCA TTT TTA ATG AAA AAT GTC ATT CAA TTA TCT AAT CCA 
VLILASFLMKNVIQLSNP 

AGT ATC GAT GTC TTA TCG GTC ATT CTT TCT ACT CTC GGC TTT GGT AGC TTA CTT 

SIDVLSVILSTLGFGSLL 
TAC GGT TTC TCA AGC GTT GGT GAT AAA GGT TGG GGT AGC CCG CAA GTC TAT GGA 

YGFSSVGDKGWGSPQVYG 
TTC TTA ATT GTT GGT GCG ATT GTT TTA TGT CTT TTT ACA TAT CGA CAA TTG CAT 

PLIVGAIVLCLFTYRQLH 
TTA GAA CAA CCT TTC TTA GAA CTA CGC GTT TTT AAA TCA AAA GTC TTT ACT GTT 

LEQPFLELRVFKSKVFTV 
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5' ATG GAG AAA ATT AAA AGT TAC CAA GAG GAT TCP GAA GTG CAG AAA AAT CGT TGG 

MEKIKSYQBDSBVQKNRW 
TGG ATC TTA GTT TCT GTG GCT ATG TTT ACC TTC ATG TCT ACA TTA GAT TCC AGC 

WILVSVAMFTPMSTLDSS 
ATT GTA AAT ATT GCT TTA CCG ACT ATT TCA AAA GAA ATG TCC GTA CCC ATG AAT 

IVNIALPTISKEMSVPMN 
CAA GCG GAA TGG GTC GTA TCT ATT TAT TTA ATG GTT GTC TGT GCT TGT TTA TTA 

QAEWVVS I YIiMVVCACLL 
TTA TTT GGA AAA ATT GGC GAC AGT TTT GGA AAA ATA AAA GTT TAT CGA ATT GGC 

LFGKI GDSPGKI KVYRIG 
ACG GTT ATT TTT ACG ATT GGT TCG TTA CTT TGT GGC TTT AAT CAA TCG CTG AGC 

TVIFTIGSLLCGFNQSLS 
TTT CTA TTA TTT GCG CGA GTC GTT CAA AGT ATT GGT GCT AGC ATG ACC ATG GCG 

FLLFARVVQSIGASMTMA 
ACC AAT TCA GGA ATT ATT ACG GAA GTA TTC CCT TTA AAT GAA AGA GGG CGG GCG 

TNSGI ITEVFPLNBRGRA 
CTT GGT GCA GTA GGT GCC TTT GTC TCT TTA RGC GCG ATT GCT GGT CCT GGG ATT 

LGAVGAFVSLXAIAGPGI 
GGT GGA TTG ATC TTA TCA AAC TTT TCT TGG TCG TAT ATT TTT TGG ATT AAT GTT 

GGLILSNFSWSYIFWINV 
CCC GTG GGG CTA GTT ACA ATT TTA ATT GGC GAA AAG TTC CTA CCA AAA GAT ATT 

PVGLVTII.IGEKFr.PKDI 
ACA AAA ACA AAA GAA AAA ATC GAT TTT TCA GGA TTT GCT TGT ATT GCC ATT GCG 

TKTKEKIDFSGFACIAIA 
ATT ATG ACG TTC TTC GGC GGG ATT TTC CTC GGA CAA QPfi AGC GGG TTT GGC TCG 
IMTFFGGI FLGQESGFGS 

TTA CAA TCC TAT TTA TTG TTC ATT ATT GCA GTA ATC GCA TTA GGA TTA TTT ATA 
LQSYLLFI IAVIALGLFI 

ATG GTT GAA CGA AAA AGA AAA AGC CCA CTT ATT AAG TTC GCA ATT TTT AAA AAC 
MVERKRKSPLIKFAIFK.N 

AAA ATT TTT ACT TTA AGT TTA TTG TCC GCC GTT TTA ATT TTT GCC TCA AAC TTT 
KI FT L S LLS AVL I FAS N F 
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TTT GTA AAT GTT GTC ATT CCA TTT TAT TTG CAQ GAT GCA CGC AAA ,£TA*_jAGT OCT 

FVNVVI PPYLQDARKLSA 
AGT TAT GCT GGT CTA TTA ATG ATG GTA TTT CCG TTA TTA ATG GTG GTT GGG GCT 

SYAGLLMMVFPIiIiMVVGA 
CCT CTG AGT GGC TAT TTG ACG GAT AAA ATT GGC CCA GGT ATT TTA ACA TTT GGC 

PLSGYLTDKIGPGILTFG 
GGA TTG TTG CTC TTG TGC TGT ACG TCG TTA ATG TAT ATG TTT TTA GAT ATG AAT 

GLLliLCCTSLMYMPIiDMN 
TCG CCT ATC TGG TAT TAT GTG ATT GCA ACG GCC ATT ATG GGC TTG GGA AAT GCA 

SPIWYYVIATAIMGLGNA 
CTT TTC CAG TCT CCA AAC AAT ACA ATG GTT ATG AGC AGT GTT GAA AAG- CAA GAT 

LFQSPNNTMVMSSVEKQD 
TTA GGT GTA GCA GGA AGT ATG AAT TCT TTT GCT AGA AAC TTA GGA ATG GTC ATT 

LGVAGSMNSFARNLGMVI 
GGG ATT GCG TTG TCA ACG ACC ATT TTA TAT CGT GGC ATG AGC GAA GCC TAT GGT 

GIAIiSTTILYRGMSEAYG 
GAA CGA GTA ACC ACG TAT CTG GCT AAT CGC CCA GAT ATA TTT ATT GTG GGA ATG 

brvttylanrpdipivgm 
cgt gaa acc ttt ttt gtc gcc ttt cta tta tgt gtg gcg gcc ttt ata tta acg 

RETFFVAFIiLCVAAFIIiT 
ATT TTA CGT TTT CGG AAA ACA ACC AAA TAA 3 ' 
ILRFRKTTK* 
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5 « ATG GAA CAA AAA AAC OTA CGA TTA TTC CCT GCC GTT TTA GCG ACA GQA ATT ATG 

MEQKHVR LFPAVLATGIM 
TCT TTT GCT GGC GTA TTA ATT GAA ACA GCA ATG AAT GTA ACC TTT CCG ACA TTA 

SPAGVLIETAMNVTFPTIi 
ACA AAG GAA TTT GGT GTT TCC ACC GGT ACC GTG CAG TGG GTG ACA ACQ ATT TAT 

TKBFGVSTGTVQWVTTIY 
TTA TTA GTT ATT TCT ATC ATG GTG CCC TTA TCC AAT TAT TTA TTA AAA ACC TAT 

LLVISIMV PLSNYLLKTY 
TCG TTA AGA CGC TTA TTT ATT GTT GCG AAT CTT TTC TTT TTG ATT GGT TTA GCG 

SLRRLFIVANLFFLIGLA 
ATT GAT GTC TAT TCG CCA TCT TTC AGT ATC TTG TTA TTA GGC CGA CTC TTC CAA 

IDVYSPSFSIIiLLGRIiFQ 
GGG GCT AGC ACT GGG ATT GCT TTG CCA CTG ATG TTT CAT ATT ATT TTG AAC TTT 

GASTG IALPLMFHI I ZiNF 
ACC CCG CTG GAA AAA CGA KGA ACG ATG ATG GGG GTA SGC ACA TTG ACC ACT TCA 

TPLEKRXTMMGVXTLTTS 
ATT GCT CCA GCC ATT GGT CCA ACG TAT GGT GGA ATC TTA ACA TCT TCC CTG TCT 

IAPAIGPTYGGILTSSIiS 
TCG CAC GCC ATC TTT TTA TTC TTA ATT CCT ATT TTA CTG CTT TCT TTG TTT ATG 

WHAIFLFLIPIliLIiSIiFM 
GGC TTA TCT GCT ATT CCA GAA ATA CCT GTT AAA AAA ACC ACC ACT CTT GAT TTG 

GIiSAX PBIPVKKTTTIiDL 

GTT AGT TTA ATT GCT ATT GCC CTG TTA TTT AGT GGT TTG TTG ATG TTT TTA AGT 
VSLIGIALLFSGLLMFLS 

AAA ATT GGC ACA TTG TTT GGC TGG CTT TCA CTT CTG GCA GCT GTT ATT GGC TTT 
KlGTIiFGWLSLIiAAVIGF 

GTT ATT TTT TAT AAA CGA GCA ACG ACC GCT GAA CAG CCA CTA GTT CGT TTG ACA 
VlFYKRATTAEQPIiVRLT 

ATT TTA AAA AAT CCC GCC TTC GTT TTA TTT TTA TGT GGC TTT TTA GTT TGT CAA 
I LKNPAFVLFLCGFLVCQ 

TTC TTA TTG TTA GGT ATT TCC TTT GTC TTA CCA AAT TTT GTT CAA ATC GTC CTT 
FLLLGISFVLPNFVQIVL 
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5 ' ATG AAA AAC ACC CGA AAC ACC GTC CCA TCA TTA CTA ATT ATT ATT TTG CTT GTG 

MKNTRNTVPSLLIIILLV 
GGA TTT CCA CAA ATC AGC GAG TCG ATT TTC ACA CCC ATT TTG CCA GCG CTT AGT 

GFPQISESIPTPILPALS 
TCT GCT TTT AAC ATT CAC GAT AGC GTG GCA CAA TTA ACC ATG ACT ATT TAT TTT 

SAFNIHDSVAQLTMS IYF 
GTT GCC TTT GCC CTT GGC GTT TTA TTC TGG GGA ATT CTA TCC GAT ACG ATT GGA 

VAFAVGVL FWGILSDTIG 
CGT CGC CCA GCT ATG TTA TTA GGA ATT TTA TGC TAT TTA ATT GGA AAT ATT GGC 

RRPAMLIiGIliCYLIGNIG 
CTT TTT TTG AGT CCG ACA TTT CCT TGG CTG TTA TTC TTC CGT TTA ATT CAA GCA 

LFLSPTFPWLLFFRL IQA 
TTT GGT GCA AGC GTT GCT TCT GTC ATT ACC CAA TCG ATG ATG CGG GAA AGC TTT 

FGASVGSVITQSMMRESF 
AAC GAT CTT CAA AGA GGG AAA ATT TTT GCT CAA ACA AGC GCT GCC CTA TCG CTA 

NDLQRGKIFAQTSAALSIj 
TCT CCT GCC ATC GGA CCC TTA GTC GGC GGA CTT GTT CAA ACC TAT TTT GGC TAT 

SPAIGPLVGGLVQTYFGY 
CAG CAT GTC TTT AGC ACC TTA ATT GTC ATG GCT GTT CTA TTA CTT TTC TAT AGT 

QHVFSTLIVMAVIiIiI* FYS 
TTT TAT CGT TTA CCA GAA ACC TTT CAC TTA GGA AAT ACC TTA ACG AAA GCG CCT 

FYRLPETFHLGNTLTKAP 

TTT TGG TCC GTG GCG AAA CGC CTT TCC TTC GAC TCG AAA GTT TGG TGC TAC GGC 
FWSVAKRLSFDSKVW CYG 

TTA TTG ATT GGG GGC ATT AAT GGT TTA TTG TTT AGC TAC TAT GCA GAA GCT CCG 
LLIGGINGLIiFSYYABAP 

TTT ATT TTC ATC AAT CAC TTT GGC TTT AGC AGT GTC CAA TAT GGC TTT CTT GGT 
FIFINHFGFSSVQYGFLG 

TTA GTT TTA GCT GTG TCT AGT ATT CTT GCT GCG GTT CTT ATC AAT TGG TTG CTA 
LVLAVSSILAAVLINWLL 

GGA TAT TGG TCA GCG GAA AAA ATT GCC TTT GCT GGT TTG CTG CTA AGT ATC TTT 
GYWSABKIAFAGLLL S IF 
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GCG GCT TGC TGT TTA TTA ATA ATG ACA CAA TTT GAA ACA ATC ATT QGT,_.£JTTj ATT * 

AACCLLIMTQFETIIGLI 
TTG TTT ATT TTT CTC TTT TTC TTA GOG ATT AAT ATT ACT TTG CCC AAT GCC TTA 

LPIFLPPLGINITLPNAL 
AGT ATG GCC TTA AAA GGC TAT GAA TCT GTA ATT GGT ACT GCC AGT GGG ATT TTT. 

SMALKGYESVIGTAS GIF 
AGT TTT GTC TAT TAC TTG TTT GTT AGT TTC TTT ACC TAT TTG ATT AGT TAC TTT 

SFVYYLFVSFFTYLI SYF 
CAC AAT GGT ACC ATT TGG GTC TTG CCG CTT TAC TTT TTA AGT TTT GCT TGT CTG 

HNGTIWVI1PI1YFLSFACI1 
TTA GCT TTC AGC TAT TAT GTC ATT GTT TAT CGT AAA ACT TTA AAA TAA 3 ■ 

LAFSYYVIVYRKTLK* 
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5» GTG GGG CTT AAC ATT GTC TTA CCA CTC GTT AAT ATT ATT AAT GGA TTA GOT TGG 

VGLNIVLPLVKI inglgw 
ATG TTT GGC GTT GGC GGC GCT ACG TTG TTT TCG ACC ACG GTT GCG CAA AAA GAA 

MFGVGGATLFSTTVAQKB 
ATC AAA AAA GCC AAT CAG TAT TTT AGT TTA ACG ATT GGG TTA GTT TTC GTT ATA 

IKKANQYPSLTIGLVFVI 
GGG AGT CTC TTT ACT TTA GTG AGC TTG ATT TTT TCC GAT CAA ATT ATT CGG GGG 

GSLFTLVSLIFSDQI IRQ 
CTT CAG GGG ACC GOT GTC TTG TTT GGT TTG GCA AAG GAA TAT TAT ATG ATT TAC 

LQGTGVLPGIjAKBYYMIY 
CTT AGC TGT TCA TTA TTA TTT ATT TTA AAC AAT TGC CTG ATT ACT TTT TTA AGA 

LSCSLLFILNNCLITFLR 
AAT GAC CAT AAT CCT CGT TTG GCA ACG ATT GCT TTT GTC AGT GGG GGA ATT GTC 

NDHNPRLATIAFVSGGIV 
AAC ATT ATT TTG GAT TAT GTC TTT ATT TAT CAA TTT GGG TGG GGT ATG GCA GGT 

NIILDYVFIYQFGWGMAG 
GCG GCC ATT GCA ACG GTG ATG TCA CCG TTA ACA AGC TTA ATC CTG ATT ACG TTG 

AAIATVMSPI1TSI1ILITL 
CAT AAG TGG TCG CCG CAA CGA GTT TTA CGT TTT GAA AAG TTT AAA GTG AAA TTT 

HKWSPQRVLRFBKFKVKF 
CAG GAT GTT CGA GAA ATT ATG TCG ATT GGT TTT TCC TCC TTT TTA AAT GAA TTT 

QDVREIMSIGFSSFLNEF 
TCT TCG GCC TTT GTC ATG TTT TTA TTT AAT ATT GTT TTG TTA AAC TTA GTT GGT 
SSAFVMFLFNIVLLNLVG 

CAT GTG GCG ATT TCA GCT TAT GCG ATT GTT GCC AAT CTC AAT ATT ATT GTG ATT 
HVAI SAYAIVANLNI IVI 

GCT ATT TTT ACA GGG ATT GGG CAA GGG GCG CAA CCA TTA CTC AGC CGA TAC TAT 
AIFTGIGQGAQPLLSRYY 

GGT TTA GGG GAA ACG AAG GTG TTG CGT AAA TTT GTT AAA CTT AGC TTT ATT ACG 
GLGBTKVLRKFVKLS FIT 

TAC TTG GTC GCC GGC TTC CTG TTC TTT TTA ATC AGT CAA CTG TTT ACG GGA CAA 
YLVAGFLFFLISQVFTG.Q 
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ATT ATT GAG GTC TTT AAT AGC GAA GGG AAC GAC CAG TTG GCG CAA ATT, GCT AGG 



IIEVFNSEGNDQLAQIAR 
ACA GCC ATC CGC TTA TAT GCG ATT TCC TTC TTG TTT ACA GGA TTG AAT TTT ATG 



T A I R L 


YAISFLFTG 


L N F M 


GGG ATT TAT TAC TTT 


TCG GCA GTT CGT AAA CCG AAA ATG GCG 


CTA ATG ATT TCC 


G I Y Y F 


SAVRKPKMA 


L M I S 


TCT TTG CGT GGG TTC 


TTT TTA ATT GTG CCA GTT TTA TTT ATT 


TTG GTG AAA TTA 


S L R G F 


FLIVPVLFI 


L V K L 


CTA GGA TTA ACC GGT 


GTT TGG TTA GCC ATG CCA GTA GTT GAA 


TTT GTT ACG TTT 


L G L T G 


VWLAMPVVB 


F V T F 


GGA CTA ATG CTT GTG 


GGC TAC CTT GCG TAT CGA AAC TAT TTG AAA AAA AGA GAA 


G L M L V 


GYLAYRNYL 


K K R E 



GCA GTA ACC TGA 3 ' 
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5» ATG CCA TAT TTA TAT TTG TTA TTA GCA ATA GTC AGT GAA GTA ATA GGC**2flGT*GfeA*- 

MPYLYLLLAIVSEVIGSA 
TTT TTA AAA TCT TCA GAC GGT TTC TCA AAA TTA TAT CCA ACT ATA ACA ACA ATC 

FLKSSDGFSRLYPTITTI 
ATT TCA TAC CTT ATT TCT TTT TAT TTT CTG AGT AAA ACT ATG CAA CAT TTA CCA 

ISYLISPYFIiSKTMQHLP 
CTT AAT ATT GCT TAC GCA AGT TGG TCA GGT TTA GGA TTA GTA TTA ACA ACA ATT 

LNIAYASWSGLGLVLTTI 
GTT TCA GTT CTT ATT TTC AAA GAA CAA ATA AAT TTA ATA AGC ATT ATT TCA ATT 

VSVIiIFKEQINLISIISI 
ATT TTA ATA ATA TTT GGT GTT GTT TTA CTA AAC ACA TTC GGA TCA TCA CAC TAA 3 ■ 

ILI I FGVVLIiNTFGSS H* 
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